Section: Importing modules

In [1]:
import pandas as pd
import numpy as np
import matplotlib
import matplotlib.pyplot as plt
from IPython.display import display
import IPython.core.display as di
from include.config import CleaningConfig, PlottingConfig, NotebookConfig
from include.counting import get_count, get_percentage
from include.plotting import get_plot, display_side_by_side
from include.likertScalePlot import likert_scale
from include.textCleaning import wordcloud
In [2]:
pd.set_option('display.max_rows', 1000)
pd.set_option('display.max_columns', 1000) 

This text is used to hide the code cell when exported in html

In [3]:
di.display_html('<script>jQuery(function() {if (jQuery("body.notebook_app").length == 0) { jQuery(".input_area").toggle(); jQuery(".prompt").toggle();}});</script>', raw=True)
In [4]:
get_ipython().magic('matplotlib inline')  # Activate that line to use in Jupyter 
matplotlib.rcParams['figure.figsize'] = (15.0, 8.0)

Loading dataset

In [5]:
df =  pd.read_csv('./uk_2017/data/cleaned_data.csv')

The total of participant after cleaning the dataset. Remove all participants that have not completed the survey passed the first pages and all participants who are not from the country

In [6]:
len(df)
Out[6]:
253

Section: 1

Group of question: socio

In which country do you work?

In [7]:
v_to_count, filename  = get_count(df, ['socio1. In which country do you work?'], "one choice", "./../survey_creation/uk_17/listAnswers/countries.csv", "False", "./uk_2017/output/")
In [8]:
perc_to_count = get_percentage(v_to_count, filename)
In [9]:
display_side_by_side(v_to_count,perc_to_count)
In which country do you work?
United Kingdom 253
In which country do you work? [PERCENTAGE]
United Kingdom [PERCENTAGE] 100.0
In [10]:
_ = get_plot(perc_to_count, "one choice")

Group of question: edu

What is the highest qualification you have obtained?

In [11]:
v_to_count, filename  = get_count(df, ['edu1. What is the highest qualification you have obtained?'], "one choice", "./../survey_creation/uk_17/listAnswers/education.csv", "True", "./uk_2017/output/")
In [12]:
perc_to_count = get_percentage(v_to_count, filename)
In [13]:
display_side_by_side(v_to_count,perc_to_count)
What is the highest qualification you have obtained?
Undergraduate degree 170
Master degree 56
Doctorate 25
Other 2
What is the highest qualification you have obtained? [PERCENTAGE]
Undergraduate degree 67.19
Master degree 22.13
Doctorate 9.88
Other 0.79
In [14]:
_ = get_plot(perc_to_count, "one choice")

In which subject is your highest academic qualification?

In [15]:
v_to_count, filename  = get_count(df, ['edu2. In which subject is your highest academic qualification?'], "one choice", "./../survey_creation/uk_17/listAnswers/academic_field.csv", "False", "./uk_2017/output/")
In [16]:
perc_to_count = get_percentage(v_to_count, filename)
In [17]:
display_side_by_side(v_to_count,perc_to_count)
In which subject is your highest academic qualification?
Physics and Astronomy 69
Computer Science 69
Mathematics 19
Chemistry 16
Biological Sciences 15
Other 12
Geography & Environmental Sciences 10
Electrical & Electronic Engineering 8
Civil Engineering 6
Mechanical Engineering 4
Materials Technology 3
Linguistics 3
Geology 3
General Engineering 3
Psychology 3
Aeronautical & Manufacturing Engineering 2
Medicine 2
History 1
Robotics 1
Art & Design 1
Theology & Religious Studies 1
NaN 2
In which subject is your highest academic qualification? [PERCENTAGE]
Physics and Astronomy 27.49
Computer Science 27.49
Mathematics 7.57
Chemistry 6.37
Biological Sciences 5.98
Other 4.78
Geography & Environmental Sciences 3.98
Electrical & Electronic Engineering 3.19
Civil Engineering 2.39
Mechanical Engineering 1.59
Materials Technology 1.20
Linguistics 1.20
Geology 1.20
General Engineering 1.20
Psychology 1.20
Aeronautical & Manufacturing Engineering 0.80
Medicine 0.80
History 0.40
Robotics 0.40
Art & Design 0.40
Theology & Religious Studies 0.40
In [18]:
_ = get_plot(perc_to_count, "one choice")

Enter your academic subject

In [19]:
 wc = wordcloud(df, ['edu3. Enter your academic subject'])
<wordcloud.wordcloud.WordCloud object at 0x7fdb91dda390>
In [20]:
 plt.imshow(wc, interpolation='bilinear')
 plt.axis("off")
Out[20]:
(-0.5, 1499.5, 799.5, -0.5)

Do you hold any other professional qualifications?

In [21]:
 wc = wordcloud(df, ['edu4. Do you hold any professional qualifications?'])
<wordcloud.wordcloud.WordCloud object at 0x7fdb91edb630>
In [22]:
 plt.imshow(wc, interpolation='bilinear')
 plt.axis("off")
Out[22]:
(-0.5, 1499.5, 799.5, -0.5)

Group of question: rse

Do you write code as part of your job?

In [23]:
v_to_count, filename  = get_count(df, ['rse1. Do you write code as part of your job?'], "y/n/na", "./../survey_creation/uk_17/listAnswers/.csv", "False", "./uk_2017/output/")
In [24]:
perc_to_count = get_percentage(v_to_count, filename)
In [25]:
display_side_by_side(v_to_count,perc_to_count)
Yes No
Do you write code as part of your job? 246 7
Yes No
Do you write code as part of your job? [PERCENTAGE] 97.23 2.77
In [26]:
_ = get_plot(perc_to_count, "y/n/na")

Who uses the code that you write?

In [27]:
v_to_count, filename  = get_count(df, ['rse3[SQ001]. Who uses the code that you write? []'], "likert", "./../survey_creation/uk_17/listAnswers/likert_usage_5.csv", "False", "./uk_2017/output/")
In [28]:
display(v_to_count) 
0 - Mostly me 1 2 3 4 5 - Mostly other people
Who uses the code that you write? [] 7 34 31 38 60 76
In [29]:
_ = get_plot(v_to_count, "likert")

Group of question: soft

Do you consider yourself a professional software developer?

In [30]:
v_to_count, filename  = get_count(df, ['soft2can. Do you consider yourself a professional software developer?'], "y/n/na", "./../survey_creation/uk_17/listAnswers/.csv", "False", "./uk_2017/output/")
In [31]:
perc_to_count = get_percentage(v_to_count, filename)
In [32]:
display_side_by_side(v_to_count,perc_to_count)
Yes No
Do you consider yourself a professional software developer? 147 106
Yes No
Do you consider yourself a professional software developer? [PERCENTAGE] 58.1 41.9
In [33]:
_ = get_plot(perc_to_count, "y/n/na")

How many years of Software development experience do you have?

In [34]:
v_to_count, filename  = get_count(df, ['soft1can. How many years of software development experience do you have?'], "freenumeric", "./../survey_creation/uk_17/listAnswers/.csv", "False", "./uk_2017/output/")
In [35]:
perc_to_count = get_percentage(v_to_count, filename)
In [36]:
display_side_by_side(v_to_count,perc_to_count)
How many years of software development experience do you have?
0 15.0
1 9.0
2 NaN
3 20.0
4 14.0
5 25.0
6 NaN
7 18.0
8 NaN
9 NaN
10 12.0
11 16.0
12 7.0
13 5.0
14 20.0
15 15.0
16 9.0
17 30.0
18 15.0
19 NaN
20 30.0
21 15.0
22 13.0
23 7.0
24 10.0
25 5.0
26 5.0
27 10.0
28 NaN
29 19.0
30 NaN
31 5.0
32 10.0
33 3.0
34 NaN
35 NaN
36 4.0
37 NaN
38 15.0
39 8.0
40 12.0
41 4.0
42 25.0
43 NaN
44 10.0
45 3.0
46 18.0
47 7.0
48 15.0
49 NaN
50 10.0
51 NaN
52 7.0
53 3.0
54 5.0
55 NaN
56 17.0
57 NaN
58 NaN
59 NaN
60 10.0
61 NaN
62 NaN
63 NaN
64 NaN
65 1.0
66 NaN
67 11.0
68 16.0
69 11.0
70 NaN
71 17.0
72 20.0
73 4.0
74 16.0
75 11.0
76 3.0
77 NaN
78 NaN
79 18.0
80 17.0
81 NaN
82 NaN
83 35.0
84 NaN
85 15.0
86 15.0
87 40.0
88 NaN
89 NaN
90 NaN
91 20.0
92 NaN
93 NaN
94 12.0
95 NaN
96 NaN
97 NaN
98 8.0
99 15.0
100 12.0
101 10.0
102 NaN
103 17.0
104 NaN
105 NaN
106 NaN
107 30.0
108 NaN
109 NaN
110 NaN
111 12.0
112 18.0
113 11.0
114 2.0
115 6.0
116 NaN
117 27.0
118 11.0
119 NaN
120 NaN
121 20.0
122 18.0
123 27.0
124 6.0
125 10.0
126 NaN
127 5.0
128 12.0
129 NaN
130 13.0
131 20.0
132 10.0
133 NaN
134 NaN
135 20.0
136 30.0
137 NaN
138 6.0
139 NaN
140 10.0
141 NaN
142 2.0
143 15.0
144 NaN
145 20.0
146 NaN
147 NaN
148 NaN
149 13.0
150 NaN
151 NaN
152 NaN
153 14.0
154 NaN
155 NaN
156 NaN
157 7.0
158 NaN
159 NaN
160 NaN
161 17.0
162 34.0
163 17.0
164 8.0
165 20.0
166 NaN
167 9.0
168 NaN
169 NaN
170 NaN
171 12.0
172 NaN
173 20.0
174 8.0
175 NaN
176 10.0
177 NaN
178 NaN
179 12.0
180 NaN
181 NaN
182 NaN
183 NaN
184 NaN
185 NaN
186 20.0
187 21.0
188 20.0
189 7.0
190 NaN
191 16.0
192 4.0
193 3.0
194 20.0
195 20.0
196 NaN
197 NaN
198 12.0
199 NaN
200 NaN
201 NaN
202 7.0
203 NaN
204 NaN
205 6.0
206 NaN
207 10.0
208 NaN
209 NaN
210 NaN
211 NaN
212 9.0
213 NaN
214 NaN
215 15.0
216 NaN
217 35.0
218 NaN
219 5.0
220 5.0
221 NaN
222 15.0
223 25.0
224 NaN
225 5.0
226 9.0
227 10.0
228 7.0
229 NaN
230 11.0
231 10.0
232 NaN
233 NaN
234 20.0
235 5.0
236 26.0
237 NaN
238 14.0
239 14.0
240 NaN
241 25.0
242 NaN
243 25.0
244 10.0
245 20.0
246 3.0
247 20.0
248 5.0
249 30.0
250 NaN
251 20.0
252 10.0
How many years of software development experience do you have? [PERCENTAGE]
0 NaN
1 NaN
2 NaN
3 NaN
4 NaN
5 NaN
6 NaN
7 NaN
8 NaN
9 NaN
10 NaN
11 NaN
12 NaN
13 NaN
14 NaN
15 NaN
16 NaN
17 NaN
18 NaN
19 NaN
20 NaN
21 NaN
22 NaN
23 NaN
24 NaN
25 NaN
26 NaN
27 NaN
28 NaN
29 NaN
30 NaN
31 NaN
32 NaN
33 NaN
34 NaN
35 NaN
36 NaN
37 NaN
38 NaN
39 NaN
40 NaN
41 NaN
42 NaN
43 NaN
44 NaN
45 NaN
46 NaN
47 NaN
48 NaN
49 NaN
50 NaN
51 NaN
52 NaN
53 NaN
54 NaN
55 NaN
56 NaN
57 NaN
58 NaN
59 NaN
60 NaN
61 NaN
62 NaN
63 NaN
64 NaN
65 NaN
66 NaN
67 NaN
68 NaN
69 NaN
70 NaN
71 NaN
72 NaN
73 NaN
74 NaN
75 NaN
76 NaN
77 NaN
78 NaN
79 NaN
80 NaN
81 NaN
82 NaN
83 NaN
84 NaN
85 NaN
86 NaN
87 NaN
88 NaN
89 NaN
90 NaN
91 NaN
92 NaN
93 NaN
94 NaN
95 NaN
96 NaN
97 NaN
98 NaN
99 NaN
100 NaN
101 NaN
102 NaN
103 NaN
104 NaN
105 NaN
106 NaN
107 NaN
108 NaN
109 NaN
110 NaN
111 NaN
112 NaN
113 NaN
114 NaN
115 NaN
116 NaN
117 NaN
118 NaN
119 NaN
120 NaN
121 NaN
122 NaN
123 NaN
124 NaN
125 NaN
126 NaN
127 NaN
128 NaN
129 NaN
130 NaN
131 NaN
132 NaN
133 NaN
134 NaN
135 NaN
136 NaN
137 NaN
138 NaN
139 NaN
140 NaN
141 NaN
142 NaN
143 NaN
144 NaN
145 NaN
146 NaN
147 NaN
148 NaN
149 NaN
150 NaN
151 NaN
152 NaN
153 NaN
154 NaN
155 NaN
156 NaN
157 NaN
158 NaN
159 NaN
160 NaN
161 NaN
162 NaN
163 NaN
164 NaN
165 NaN
166 NaN
167 NaN
168 NaN
169 NaN
170 NaN
171 NaN
172 NaN
173 NaN
174 NaN
175 NaN
176 NaN
177 NaN
178 NaN
179 NaN
180 NaN
181 NaN
182 NaN
183 NaN
184 NaN
185 NaN
186 NaN
187 NaN
188 NaN
189 NaN
190 NaN
191 NaN
192 NaN
193 NaN
194 NaN
195 NaN
196 NaN
197 NaN
198 NaN
199 NaN
200 NaN
201 NaN
202 NaN
203 NaN
204 NaN
205 NaN
206 NaN
207 NaN
208 NaN
209 NaN
210 NaN
211 NaN
212 NaN
213 NaN
214 NaN
215 NaN
216 NaN
217 NaN
218 NaN
219 NaN
220 NaN
221 NaN
222 NaN
223 NaN
224 NaN
225 NaN
226 NaN
227 NaN
228 NaN
229 NaN
230 NaN
231 NaN
232 NaN
233 NaN
234 NaN
235 NaN
236 NaN
237 NaN
238 NaN
239 NaN
240 NaN
241 NaN
242 NaN
243 NaN
244 NaN
245 NaN
246 NaN
247 NaN
248 NaN
249 NaN
250 NaN
251 NaN
252 NaN
In [37]:
_ = get_plot(v_to_count, "freenumeric")
       How many years of software development experience do you have?
count                                         147.000000             
mean                                           13.707483             
std                                             7.823578             
min                                             1.000000             
25%                                             8.000000             
50%                                            12.000000             
75%                                            18.500000             
max                                            40.000000             

Group of question: time

In an average month, how much time do you spend on software development (Please rate your answer between 1 to 10. 1 Being none at all and 10 being All your time.)

In [38]:
v_to_count, filename  = get_count(df, ['time1can[time1can]. In an average month, how much time do you spend on: [Software development]', 'time1can[time2can]. In an average month, how much time do you spend on: [Research]', 'time1can[time3can]. In an average month, how much time do you spend on: [People management]', 'time1can[time6can]. In an average month, how much time do you spend on: [Project management]', 'time1can[time4can]. In an average month, how much time do you spend on: [Teaching]', 'time1can[time5can]. In an average month, how much time do you spend on: [Other activities]'], "likert", "./../survey_creation/uk_17/listAnswers/likert_time_10.csv", "False", "./uk_2017/output/")
In [39]:
display(v_to_count) 
1 (None at all) 2 3 4 5 6 7 8 9 10 (All my time)
Software development 5.0 16.0 24.0 35.0 35.0 24.0 50.0 41.0 16.0 7.0
Research 45.0 58.0 37.0 29.0 25.0 20.0 20.0 10.0 7.0 2.0
People management 102.0 76.0 33.0 16.0 6.0 7.0 7.0 5.0 NaN 1.0
Project management 32.0 86.0 58.0 23.0 26.0 11.0 9.0 4.0 2.0 2.0
Teaching 109.0 77.0 33.0 17.0 6.0 8.0 1.0 NaN 1.0 1.0
Other activities 52.0 88.0 40.0 29.0 19.0 10.0 10.0 NaN 3.0 2.0
In [40]:
_ = get_plot(v_to_count, "likert")

Section: 2

Group of question: currentEmp

What type of organisation do you work for?

In [41]:
v_to_count, filename  = get_count(df, ['currentEmp1. What type of organisation do you work for?'], "one choice", "./../survey_creation/uk_17/listAnswers/type_organisation.csv", "False", "./uk_2017/output/")
In [42]:
perc_to_count = get_percentage(v_to_count, filename)
In [43]:
display_side_by_side(v_to_count,perc_to_count)
What type of organisation do you work for?
University - within an academic group 128
National laboratory 41
University - within an IT service 21
University - within an HPC group 18
Government 17
Other 13
Private company 8
NGO 7
What type of organisation do you work for? [PERCENTAGE]
University - within an academic group 50.59
National laboratory 16.21
University - within an IT service 8.30
University - within an HPC group 7.11
Government 6.72
Other 5.14
Private company 3.16
NGO 2.77
In [44]:
_ = get_plot(perc_to_count, "one choice")

Which University?

In [45]:
v_to_count, filename  = get_count(df, ['currentEmp2. Which university?'], "one choice", "./../survey_creation/uk_17/listAnswers/universities.csv", "False", "./uk_2017/output/")
In [46]:
perc_to_count = get_percentage(v_to_count, filename)
In [47]:
display_side_by_side(v_to_count,perc_to_count)
Which university?
University of Edinburgh 23
University of Sheffield 14
University of Manchester 11
University of Oxford 11
University College London (UCL) 11
University of St Andrews 10
University of Cambridge 9
University of Southampton 8
University of Bristol 7
University of Leeds 7
Imperial College London 6
University of Birmingham 5
University of Warwick, Coventry 4
University of Dundee 3
Newcastle University 3
University of Reading 3
University of Bath 3
King's College London 2
University of Glasgow 2
Durham University, Durham and Stockton-on-Tees (Queen's Campus) 2
University of York 2
University of Exeter 2
Bournemouth University 2
Other 2
Royal Veterinary College 1
University of Leicester 1
Bangor University 1
University of Gloucestershire, Cheltenham, Gloucester and London 1
Heriot-Watt University, Edinburgh and Galashiels 1
University of Nottingham 1
Queen Mary, University of London 1
Swansea University 1
Cranfield University, Cranfield, and Shrivenham 1
University of Liverpool 1
St George's, University of London 1
University of Sussex, Falmer and Brighton 1
Birkbeck, University of London 1
NaN 88
Which university? [PERCENTAGE]
University of Edinburgh 13.94
University of Sheffield 8.48
University of Manchester 6.67
University of Oxford 6.67
University College London (UCL) 6.67
University of St Andrews 6.06
University of Cambridge 5.45
University of Southampton 4.85
University of Bristol 4.24
University of Leeds 4.24
Imperial College London 3.64
University of Birmingham 3.03
University of Warwick, Coventry 2.42
University of Dundee 1.82
Newcastle University 1.82
University of Reading 1.82
University of Bath 1.82
King's College London 1.21
University of Glasgow 1.21
Durham University, Durham and Stockton-on-Tees (Queen's Campus) 1.21
University of York 1.21
University of Exeter 1.21
Bournemouth University 1.21
Other 1.21
Royal Veterinary College 0.61
University of Leicester 0.61
Bangor University 0.61
University of Gloucestershire, Cheltenham, Gloucester and London 0.61
Heriot-Watt University, Edinburgh and Galashiels 0.61
University of Nottingham 0.61
Queen Mary, University of London 0.61
Swansea University 0.61
Cranfield University, Cranfield, and Shrivenham 0.61
University of Liverpool 0.61
St George's, University of London 0.61
University of Sussex, Falmer and Brighton 0.61
Birkbeck, University of London 0.61
In [48]:
_ = get_plot(perc_to_count, "one choice")

Which organisation do you work for?

In [49]:
 wc = wordcloud(df, ['currentEmp4. Which organisation do you work for?'])
<wordcloud.wordcloud.WordCloud object at 0x7fdb91ba61d0>
In [50]:
 plt.imshow(wc, interpolation='bilinear')
 plt.axis("off")
Out[50]:
(-0.5, 1499.5, 799.5, -0.5)

What is your official job title?

In [51]:
 wc = wordcloud(df, ['currentEmp5. What is your official job title'])
<wordcloud.wordcloud.WordCloud object at 0x7fdb8c5d4c50>
In [52]:
 plt.imshow(wc, interpolation='bilinear')
 plt.axis("off")
Out[52]:
(-0.5, 1499.5, 799.5, -0.5)

Are you known by a different job title? If so, please enter the job title you use

In [53]:
 wc = wordcloud(df, ['currentEmp6. Are you known in your group by a different job title? If so, please enter the job title you use'])
<wordcloud.wordcloud.WordCloud object at 0x7fdb91e63630>
In [54]:
 plt.imshow(wc, interpolation='bilinear')
 plt.axis("off")
Out[54]:
(-0.5, 1499.5, 799.5, -0.5)

Do you work full time or part time

In [55]:
v_to_count, filename  = get_count(df, ['currentEmp12. Do you work full time or part time?'], "one choice", "./../survey_creation/uk_17/listAnswers/type_hours.csv", "False", "./uk_2017/output/")
In [56]:
perc_to_count = get_percentage(v_to_count, filename)
In [57]:
display_side_by_side(v_to_count,perc_to_count)
Do you work full time or part time?
Full Time 242
Part-time 11
Do you work full time or part time? [PERCENTAGE]
Full Time 95.65
Part-time 4.35
In [58]:
_ = get_plot(perc_to_count, "one choice")

What type of contract are you employed on?

In [59]:
v_to_count, filename  = get_count(df, ['currentEmp10. What type of contract are you employed on?'], "one choice", "./../survey_creation/uk_17/listAnswers/type_contract.csv", "False", "./uk_2017/output/")
In [60]:
perc_to_count = get_percentage(v_to_count, filename)
In [61]:
display_side_by_side(v_to_count,perc_to_count)
What type of contract are you employed on?
Fixed term 92
Permanent -- funded by University core funding 77
Permanent -- as long as funding is available 65
Other/Not listed 15
Agency staff 3
Freelancer consultant contractor 1
What type of contract are you employed on? [PERCENTAGE]
Fixed term 36.36
Permanent -- funded by University core funding 30.43
Permanent -- as long as funding is available 25.69
Other/Not listed 5.93
Agency staff 1.19
Freelancer consultant contractor 0.40
In [62]:
_ = get_plot(perc_to_count, "one choice")

When did you start your current contract

What is the duration of your current contract?

In [63]:
v_to_count, filename  = get_count(df, ['currentEmp11. What is the duration of your current contract in months?'], "freenumeric", "./../survey_creation/uk_17/listAnswers/.csv", "False", "./uk_2017/output/")
In [64]:
perc_to_count = get_percentage(v_to_count, filename)
In [65]:
display_side_by_side(v_to_count,perc_to_count)
What is the duration of your current contract in months?
0 NaN
1 60.0
2 48.0
3 NaN
4 NaN
5 NaN
6 18.0
7 NaN
8 24.0
9 NaN
10 36.0
11 NaN
12 NaN
13 NaN
14 48.0
15 NaN
16 NaN
17 60.0
18 60.0
19 NaN
20 NaN
21 NaN
22 NaN
23 NaN
24 NaN
25 36.0
26 29.0
27 30.0
28 36.0
29 NaN
30 36.0
31 NaN
32 12.0
33 NaN
34 NaN
35 48.0
36 24.0
37 NaN
38 NaN
39 36.0
40 NaN
41 24.0
42 NaN
43 60.0
44 24.0
45 36.0
46 12.0
47 NaN
48 NaN
49 NaN
50 24.0
51 6.0
52 NaN
53 NaN
54 8.0
55 NaN
56 36.0
57 43.0
58 NaN
59 12.0
60 36.0
61 NaN
62 1.5
63 NaN
64 NaN
65 48.0
66 NaN
67 NaN
68 33.0
69 NaN
70 36.0
71 NaN
72 NaN
73 NaN
74 NaN
75 NaN
76 NaN
77 48.0
78 36.0
79 NaN
80 NaN
81 12.0
82 NaN
83 NaN
84 NaN
85 NaN
86 36.0
87 NaN
88 36.0
89 NaN
90 NaN
91 NaN
92 NaN
93 15.0
94 NaN
95 10.0
96 NaN
97 NaN
98 12.0
99 NaN
100 48.0
101 36.0
102 12.0
103 NaN
104 NaN
105 NaN
106 NaN
107 NaN
108 21.0
109 NaN
110 48.0
111 NaN
112 NaN
113 36.0
114 6.0
115 36.0
116 24.0
117 NaN
118 36.0
119 NaN
120 NaN
121 NaN
122 NaN
123 NaN
124 24.0
125 NaN
126 12.0
127 36.0
128 16.0
129 36.0
130 NaN
131 NaN
132 36.0
133 36.0
134 48.0
135 NaN
136 NaN
137 NaN
138 39.0
139 24.0
140 34.0
141 18.0
142 NaN
143 NaN
144 NaN
145 36.0
146 NaN
147 NaN
148 24.0
149 NaN
150 NaN
151 NaN
152 NaN
153 48.0
154 NaN
155 NaN
156 NaN
157 12.0
158 NaN
159 NaN
160 NaN
161 NaN
162 12.0
163 NaN
164 NaN
165 NaN
166 NaN
167 NaN
168 60.0
169 24.0
170 NaN
171 NaN
172 NaN
173 60.0
174 NaN
175 NaN
176 NaN
177 36.0
178 40.0
179 NaN
180 NaN
181 NaN
182 NaN
183 6.0
184 NaN
185 26.0
186 18.0
187 NaN
188 NaN
189 NaN
190 NaN
191 48.0
192 NaN
193 45.0
194 NaN
195 NaN
196 NaN
197 NaN
198 60.0
199 72.0
200 NaN
201 NaN
202 NaN
203 10.0
204 NaN
205 12.0
206 NaN
207 24.0
208 NaN
209 NaN
210 NaN
211 36.0
212 36.0
213 24.0
214 36.0
215 NaN
216 NaN
217 NaN
218 NaN
219 NaN
220 12.0
221 NaN
222 NaN
223 NaN
224 NaN
225 NaN
226 NaN
227 NaN
228 NaN
229 NaN
230 NaN
231 NaN
232 NaN
233 NaN
234 NaN
235 NaN
236 NaN
237 24.0
238 NaN
239 NaN
240 36.0
241 36.0
242 NaN
243 NaN
244 27.0
245 NaN
246 NaN
247 NaN
248 NaN
249 NaN
250 NaN
251 NaN
252 NaN
What is the duration of your current contract in months? [PERCENTAGE]
0 NaN
1 NaN
2 NaN
3 NaN
4 NaN
5 NaN
6 NaN
7 NaN
8 NaN
9 NaN
10 NaN
11 NaN
12 NaN
13 NaN
14 NaN
15 NaN
16 NaN
17 NaN
18 NaN
19 NaN
20 NaN
21 NaN
22 NaN
23 NaN
24 NaN
25 NaN
26 NaN
27 NaN
28 NaN
29 NaN
30 NaN
31 NaN
32 NaN
33 NaN
34 NaN
35 NaN
36 NaN
37 NaN
38 NaN
39 NaN
40 NaN
41 NaN
42 NaN
43 NaN
44 NaN
45 NaN
46 NaN
47 NaN
48 NaN
49 NaN
50 NaN
51 NaN
52 NaN
53 NaN
54 NaN
55 NaN
56 NaN
57 NaN
58 NaN
59 NaN
60 NaN
61 NaN
62 NaN
63 NaN
64 NaN
65 NaN
66 NaN
67 NaN
68 NaN
69 NaN
70 NaN
71 NaN
72 NaN
73 NaN
74 NaN
75 NaN
76 NaN
77 NaN
78 NaN
79 NaN
80 NaN
81 NaN
82 NaN
83 NaN
84 NaN
85 NaN
86 NaN
87 NaN
88 NaN
89 NaN
90 NaN
91 NaN
92 NaN
93 NaN
94 NaN
95 NaN
96 NaN
97 NaN
98 NaN
99 NaN
100 NaN
101 NaN
102 NaN
103 NaN
104 NaN
105 NaN
106 NaN
107 NaN
108 NaN
109 NaN
110 NaN
111 NaN
112 NaN
113 NaN
114 NaN
115 NaN
116 NaN
117 NaN
118 NaN
119 NaN
120 NaN
121 NaN
122 NaN
123 NaN
124 NaN
125 NaN
126 NaN
127 NaN
128 NaN
129 NaN
130 NaN
131 NaN
132 NaN
133 NaN
134 NaN
135 NaN
136 NaN
137 NaN
138 NaN
139 NaN
140 NaN
141 NaN
142 NaN
143 NaN
144 NaN
145 NaN
146 NaN
147 NaN
148 NaN
149 NaN
150 NaN
151 NaN
152 NaN
153 NaN
154 NaN
155 NaN
156 NaN
157 NaN
158 NaN
159 NaN
160 NaN
161 NaN
162 NaN
163 NaN
164 NaN
165 NaN
166 NaN
167 NaN
168 NaN
169 NaN
170 NaN
171 NaN
172 NaN
173 NaN
174 NaN
175 NaN
176 NaN
177 NaN
178 NaN
179 NaN
180 NaN
181 NaN
182 NaN
183 NaN
184 NaN
185 NaN
186 NaN
187 NaN
188 NaN
189 NaN
190 NaN
191 NaN
192 NaN
193 NaN
194 NaN
195 NaN
196 NaN
197 NaN
198 NaN
199 NaN
200 NaN
201 NaN
202 NaN
203 NaN
204 NaN
205 NaN
206 NaN
207 NaN
208 NaN
209 NaN
210 NaN
211 NaN
212 NaN
213 NaN
214 NaN
215 NaN
216 NaN
217 NaN
218 NaN
219 NaN
220 NaN
221 NaN
222 NaN
223 NaN
224 NaN
225 NaN
226 NaN
227 NaN
228 NaN
229 NaN
230 NaN
231 NaN
232 NaN
233 NaN
234 NaN
235 NaN
236 NaN
237 NaN
238 NaN
239 NaN
240 NaN
241 NaN
242 NaN
243 NaN
244 NaN
245 NaN
246 NaN
247 NaN
248 NaN
249 NaN
250 NaN
251 NaN
252 NaN
In [66]:
_ = get_plot(v_to_count, "freenumeric")
       What is the duration of your current contract in months?
count                                          92.000000       
mean                                           31.385870       
std                                            15.376431       
min                                             1.500000       
25%                                            20.250000       
50%                                            36.000000       
75%                                            36.000000       
max                                            72.000000       

In which disciplines do you work (select as many as apply)

In [67]:
v_to_count, filename  = get_count(df, ['currentEmp13[SQ001]. In which disciplines do you work (select as many as apply)? [Accounting & Finance]', 'currentEmp13[SQ002]. In which disciplines do you work (select as many as apply)? [Aeronautical & Manufacturing Engineering]', 'currentEmp13[SQ003]. In which disciplines do you work (select as many as apply)? [Agriculture & Forestry]', 'currentEmp13[SQ004]. In which disciplines do you work (select as many as apply)? [American Studies]', 'currentEmp13[SQ005]. In which disciplines do you work (select as many as apply)? [Anatomy & Physiology]', 'currentEmp13[SQ006]. In which disciplines do you work (select as many as apply)? [Anthropology]', 'currentEmp13[SQ007]. In which disciplines do you work (select as many as apply)? [Architecture]', 'currentEmp13[SQ008]. In which disciplines do you work (select as many as apply)? [Art & Design]', 'currentEmp13[SQ009]. In which disciplines do you work (select as many as apply)? [Biological Sciences]', 'currentEmp13[SQ010]. In which disciplines do you work (select as many as apply)? [Business & Management Studies]', 'currentEmp13[SQ011]. In which disciplines do you work (select as many as apply)? [Chemical Engineering]', 'currentEmp13[SQ012]. In which disciplines do you work (select as many as apply)? [Chemistry]', 'currentEmp13[SQ013]. In which disciplines do you work (select as many as apply)? [Civil Engineering]', 'currentEmp13[SQ014]. In which disciplines do you work (select as many as apply)? [Classics & Ancient History]', 'currentEmp13[SQ015]. In which disciplines do you work (select as many as apply)? [Communication & Media Studies]', 'currentEmp13[SQ016]. In which disciplines do you work (select as many as apply)? [Complementary Medicine]', 'currentEmp13[SQ017]. In which disciplines do you work (select as many as apply)? [Computer Science]', 'currentEmp13[SQ018]. In which disciplines do you work (select as many as apply)? [Counselling]', 'currentEmp13[SQ019]. In which disciplines do you work (select as many as apply)? [Criminology]', 'currentEmp13[SQ020]. In which disciplines do you work (select as many as apply)? [Dentistry]', 'currentEmp13[SQ021]. In which disciplines do you work (select as many as apply)? [East & South Asian Studies]', 'currentEmp13[SQ022]. In which disciplines do you work (select as many as apply)? [Economics]', 'currentEmp13[SQ023]. In which disciplines do you work (select as many as apply)? [Education]', 'currentEmp13[SQ024]. In which disciplines do you work (select as many as apply)? [Electrical & Electronic Engineering]', 'currentEmp13[SQ025]. In which disciplines do you work (select as many as apply)? [English]', 'currentEmp13[SQ026]. In which disciplines do you work (select as many as apply)? [Fashion]', 'currentEmp13[SQ027]. In which disciplines do you work (select as many as apply)? [Food Science]', 'currentEmp13[SQ028]. In which disciplines do you work (select as many as apply)? [French]', 'currentEmp13[SQ029]. In which disciplines do you work (select as many as apply)? [Geography & Environmental Sciences]', 'currentEmp13[SQ030]. In which disciplines do you work (select as many as apply)? [Geology]', 'currentEmp13[SQ031]. In which disciplines do you work (select as many as apply)? [General Engineering]', 'currentEmp13[SQ032]. In which disciplines do you work (select as many as apply)? [German]', 'currentEmp13[SQ033]. In which disciplines do you work (select as many as apply)? [History]', 'currentEmp13[SQ034]. In which disciplines do you work (select as many as apply)? [History of Art, Architecture & Design]', 'currentEmp13[SQ035]. In which disciplines do you work (select as many as apply)? [Hospitality, Leisure, Recreation & Tourism]', 'currentEmp13[SQ036]. In which disciplines do you work (select as many as apply)? [Iberian Languages/Hispanic Studies]', 'currentEmp13[SQ037]. In which disciplines do you work (select as many as apply)? [Land & Property Management]', 'currentEmp13[SQ038]. In which disciplines do you work (select as many as apply)? [Law]', 'currentEmp13[SQ039]. In which disciplines do you work (select as many as apply)? [Librarianship & Information Management]', 'currentEmp13[SQ040]. In which disciplines do you work (select as many as apply)? [Linguistics]', 'currentEmp13[SQ041]. In which disciplines do you work (select as many as apply)? [Marketing]', 'currentEmp13[SQ042]. In which disciplines do you work (select as many as apply)? [Materials Technology]', 'currentEmp13[SQ043]. In which disciplines do you work (select as many as apply)? [Mathematics]', 'currentEmp13[SQ044]. In which disciplines do you work (select as many as apply)? [Mechanical Engineering]', 'currentEmp13[SQ045]. In which disciplines do you work (select as many as apply)? [Medicine]', 'currentEmp13[SQ046]. In which disciplines do you work (select as many as apply)? [Middle Eastern and African Studies]', 'currentEmp13[SQ047]. In which disciplines do you work (select as many as apply)? [Music]', 'currentEmp13[SQ048]. In which disciplines do you work (select as many as apply)? [Nursing]', 'currentEmp13[SQ049]. In which disciplines do you work (select as many as apply)? [Ophthalmics]', 'currentEmp13[SQ050]. In which disciplines do you work (select as many as apply)? [Pharmacology & Pharmacy]', 'currentEmp13[SQ051]. In which disciplines do you work (select as many as apply)? [Philosophy]', 'currentEmp13[SQ052]. In which disciplines do you work (select as many as apply)? [Physics and Astronomy]', 'currentEmp13[SQ053]. In which disciplines do you work (select as many as apply)? [Physiotherapy]', 'currentEmp13[SQ054]. In which disciplines do you work (select as many as apply)? [Politics]', 'currentEmp13[SQ055]. In which disciplines do you work (select as many as apply)? [Psychology]', 'currentEmp13[SQ056]. In which disciplines do you work (select as many as apply)? [Robotics]', 'currentEmp13[SQ057]. In which disciplines do you work (select as many as apply)? [Russian & East European Languages]', 'currentEmp13[SQ058]. In which disciplines do you work (select as many as apply)? [Social Policy]', 'currentEmp13[SQ059]. In which disciplines do you work (select as many as apply)? [Social Work]', 'currentEmp13[SQ060]. In which disciplines do you work (select as many as apply)? [Sociology]', 'currentEmp13[SQ061]. In which disciplines do you work (select as many as apply)? [Sports Science]', 'currentEmp13[SQ062]. In which disciplines do you work (select as many as apply)? [Theology & Religious Studies]', 'currentEmp13[SQ063]. In which disciplines do you work (select as many as apply)? [Town & Country Planning and Landscape Design]', 'currentEmp13[SQ064]. In which disciplines do you work (select as many as apply)? [Veterinary Medicine]', 'currentEmp13[SQ065]. In which disciplines do you work (select as many as apply)? [Youth Work]'], "multiple choices", "./../survey_creation/uk_17/listAnswers/academic_field.csv", "False", "./uk_2017/output/")
In [68]:
perc_to_count = get_percentage(v_to_count, filename)
In [69]:
display_side_by_side(v_to_count,perc_to_count)
Count
Computer Science 112
Biological Sciences 87
Physics and Astronomy 80
Mathematics 42
Chemistry 39
Medicine 37
Geography & Environmental Sciences 34
Electrical & Electronic Engineering 25
Mechanical Engineering 24
General Engineering 20
Materials Technology 17
Education 15
Psychology 12
Chemical Engineering 12
Civil Engineering 12
Aeronautical & Manufacturing Engineering 11
Geology 11
Linguistics 11
Anatomy & Physiology 11
Librarianship & Information Management 9
Robotics 9
History 8
Architecture 8
Sociology 8
Economics 8
Classics & Ancient History 7
Social Policy 7
Business & Management Studies 7
Pharmacology & Pharmacy 6
Agriculture & Forestry 6
Middle Eastern and African Studies 5
Anthropology 5
Ophthalmics 4
Food Science 4
Theology & Religious Studies 4
Town & Country Planning and Landscape Design 4
Communication & Media Studies 4
Art & Design 4
History of Art, Architecture & Design 3
Law 3
Nursing 3
Politics 2
Land & Property Management 2
Philosophy 2
English 2
Physiotherapy 1
Social Work 1
Sports Science 1
Veterinary Medicine 1
Youth Work 1
Music 1
Marketing 1
Iberian Languages/Hispanic Studies 1
Hospitality, Leisure, Recreation & Tourism 1
German 1
French 1
Fashion 1
East & South Asian Studies 1
Dentistry 1
Criminology 1
Counselling 1
Complementary Medicine 1
American Studies 1
Russian & East European Languages 0
Accounting & Finance 0
Count [PERCENTAGE]
Computer Science 14.66
Biological Sciences 11.39
Physics and Astronomy 10.47
Mathematics 5.50
Chemistry 5.10
Medicine 4.84
Geography & Environmental Sciences 4.45
Electrical & Electronic Engineering 3.27
Mechanical Engineering 3.14
General Engineering 2.62
Materials Technology 2.23
Education 1.96
Psychology 1.57
Chemical Engineering 1.57
Civil Engineering 1.57
Aeronautical & Manufacturing Engineering 1.44
Geology 1.44
Linguistics 1.44
Anatomy & Physiology 1.44
Librarianship & Information Management 1.18
Robotics 1.18
History 1.05
Architecture 1.05
Sociology 1.05
Economics 1.05
Classics & Ancient History 0.92
Social Policy 0.92
Business & Management Studies 0.92
Pharmacology & Pharmacy 0.79
Agriculture & Forestry 0.79
Middle Eastern and African Studies 0.65
Anthropology 0.65
Ophthalmics 0.52
Food Science 0.52
Theology & Religious Studies 0.52
Town & Country Planning and Landscape Design 0.52
Communication & Media Studies 0.52
Art & Design 0.52
History of Art, Architecture & Design 0.39
Law 0.39
Nursing 0.39
Politics 0.26
Land & Property Management 0.26
Philosophy 0.26
English 0.26
Physiotherapy 0.13
Social Work 0.13
Sports Science 0.13
Veterinary Medicine 0.13
Youth Work 0.13
Music 0.13
Marketing 0.13
Iberian Languages/Hispanic Studies 0.13
Hospitality, Leisure, Recreation & Tourism 0.13
German 0.13
French 0.13
Fashion 0.13
East & South Asian Studies 0.13
Dentistry 0.13
Criminology 0.13
Counselling 0.13
Complementary Medicine 0.13
American Studies 0.13
Russian & East European Languages 0.00
Accounting & Finance 0.00
In [70]:
_ = get_plot(perc_to_count, "multiple choices")

Section: 3

Group of question: prevEmp

Where was your previous job based?

In [71]:
v_to_count, filename  = get_count(df, ['prevEmp1. Where was your previous job based?'], "one choice", "./../survey_creation/uk_17/listAnswers/type_organisation.csv", "False", "./uk_2017/output/")
In [72]:
perc_to_count = get_percentage(v_to_count, filename)
In [73]:
display_side_by_side(v_to_count,perc_to_count)
Where was your previous job based?
University 121
Private company 64
This is my first job 27
National laboratory 16
Government 8
Other 8
NGO 2
NaN 7
Where was your previous job based? [PERCENTAGE]
University 49.19
Private company 26.02
This is my first job 10.98
National laboratory 6.50
Government 3.25
Other 3.25
NGO 0.81
In [74]:
_ = get_plot(perc_to_count, "one choice")

Rank the following factors dependent on how strongly they influenced your decision to accept your current position

In [75]:
v_to_count, filename  = get_count(df, ['prevEmp2[1]. Rank the following factors dependent on how strongly they influenced your decision to accept your current position [Rank 1]', 'prevEmp2[2]. Rank the following factors dependent on how strongly they influenced your decision to accept your current position [Rank 2]', 'prevEmp2[3]. Rank the following factors dependent on how strongly they influenced your decision to accept your current position [Rank 3]', 'prevEmp2[4]. Rank the following factors dependent on how strongly they influenced your decision to accept your current position [Rank 4]', 'prevEmp2[5]. Rank the following factors dependent on how strongly they influenced your decision to accept your current position [Rank 5]', 'prevEmp2[6]. Rank the following factors dependent on how strongly they influenced your decision to accept your current position [Rank 6]', 'prevEmp2[7]. Rank the following factors dependent on how strongly they influenced your decision to accept your current position [Rank 7]', 'prevEmp2[8]. Rank the following factors dependent on how strongly they influenced your decision to accept your current position [Rank 8]'], "ranking", "./../survey_creation/uk_17/listAnswers/reason_leave_job.csv", "False", "./uk_2017/output/")
In [76]:
perc_to_count = get_percentage(v_to_count, filename)
In [77]:
display_side_by_side(v_to_count,perc_to_count)
Rank 1 Rank 2 Rank 3 Rank 4 Rank 5 Rank 6 Rank 7 Rank 8
Desire to work in a research environment 55 39 38 30 15 6 5 2.0
Desire to advance research 38 38 41 21 24 15 9 NaN
Opportunity to develop software 32 33 26 25 24 10 12 10.0
I want to learn new skills 27 28 23 31 23 20 11 6.0
Freedom to choose own working practices 23 19 28 22 21 18 20 6.0
Opportunity for career advancement 20 23 16 14 14 18 16 17.0
Ability to work across disciplines 15 31 27 21 17 21 6 8.0
Flexible working hours 14 14 13 27 23 19 16 23.0
The salary 9 5 9 15 13 16 27 20.0
NaN 20 23 32 47 79 110 131 161.0
Rank 1 Rank 2 Rank 3 Rank 4 Rank 5 Rank 6 Rank 7 Rank 8
Desire to work in a research environment 28.95 20.53 20.00 15.79 7.89 3.16 2.63 1.05
Desire to advance research 20.43 20.43 22.04 11.29 12.90 8.06 4.84 NaN
Opportunity to develop software 18.60 19.19 15.12 14.53 13.95 5.81 6.98 5.81
I want to learn new skills 15.98 16.57 13.61 18.34 13.61 11.83 6.51 3.55
Freedom to choose own working practices 14.65 12.10 17.83 14.01 13.38 11.46 12.74 3.82
Opportunity for career advancement 14.49 16.67 11.59 10.14 10.14 13.04 11.59 12.32
Ability to work across disciplines 10.27 21.23 18.49 14.38 11.64 14.38 4.11 5.48
Flexible working hours 9.40 9.40 8.72 18.12 15.44 12.75 10.74 15.44
The salary 7.89 4.39 7.89 13.16 11.40 14.04 23.68 17.54
In [78]:
_ = get_plot(perc_to_count, "ranking")

Section: 4

Group of question: currentWork

Do you always work with the same researcher(s), or do you regularly change the researcher(s) you work with?

In [79]:
v_to_count, filename  = get_count(df, ['currentWork1. Do you always work with the same researcher(s), or do you regularly change the researcher(s) you work with?'], "one choice", "./../survey_creation/uk_17/listAnswers/work_researchers.csv", "False", "./uk_2017/output/")
In [80]:
perc_to_count = get_percentage(v_to_count, filename)
In [81]:
display_side_by_side(v_to_count,perc_to_count)
Do you always work with the same researcher(s), or do you regularly change the researcher(s) you work with?
Regularly change researcher(s) 122
Same researcher(s) 113
NaN 18
Do you always work with the same researcher(s), or do you regularly change the researcher(s) you work with? [PERCENTAGE]
Regularly change researcher(s) 51.91
Same researcher(s) 48.09
In [82]:
_ = get_plot(perc_to_count, "one choice")

Do you work for a Research Software Group?

In [83]:
v_to_count, filename  = get_count(df, ['currentWork2. Do you work for a Research Software Group?'], "y/n/na", "./../survey_creation/uk_17/listAnswers/.csv", "False", "./uk_2017/output/")
In [84]:
perc_to_count = get_percentage(v_to_count, filename)
In [85]:
display_side_by_side(v_to_count,perc_to_count)
Yes No nan
Do you work for a Research Software Group? 59 58 136
Yes No
Do you work for a Research Software Group? [PERCENTAGE] 50.43 49.57
In [86]:
_ = get_plot(perc_to_count, "y/n/na")

Group of question: paper

Has your software contributed to research that has been published in a journal or at a conference?; In general, when your software contributes to a paper, are you acknowledged in that paper?; Are you generally named as the main author of the paper?; Are you generally named as a co-author of the paper?; Are you generally acknowledged in the main text of the paper?

In [87]:
v_to_count, filename  = get_count(df, ['paper1. Has your software contributed to research that has been published in a journal or at a conference?', 'paper2. In general, when your software contributes to a paper, are you acknowledged in that paper?', 'paper3. Are you generally named as the main author of the paper?', 'paper4. Are you generally named as a co-author of the paper?', 'paper5. Are you generally acknowledged in the main text of the paper?'], "y/n/na", "./../survey_creation/uk_17/listAnswers/.csv", "False", "./uk_2017/output/")
In [88]:
perc_to_count = get_percentage(v_to_count, filename)
In [89]:
display_side_by_side(v_to_count,perc_to_count)
Yes No nan
paper1. Has your software contributed to research that has been published in a journal or at a conference? 199 20 34
paper2. In general, when your software contributes to a paper, are you acknowledged in that paper? 139 40 74
paper3. Are you generally named as the main author of the paper? 49 86 118
paper4. Are you generally named as a co-author of the paper? 76 12 165
paper5. Are you generally acknowledged in the main text of the paper? 11 1 241
Yes No
paper1. Has your software contributed to research that has been published in a journal or at a conference? 90.87 9.13
paper2. In general, when your software contributes to a paper, are you acknowledged in that paper? 77.65 22.35
paper3. Are you generally named as the main author of the paper? 36.30 63.70
paper4. Are you generally named as a co-author of the paper? 86.36 13.64
paper5. Are you generally acknowledged in the main text of the paper? 91.67 8.33
In [90]:
_ = get_plot(perc_to_count, "y/n/na")

Group of question: conf

Have you ever presented your software work at a conference or workshop?

In [91]:
v_to_count, filename  = get_count(df, ['conf1can. Have you ever presented your software work at a conference or workshop?'], "y/n/na", "./../survey_creation/uk_17/listAnswers/.csv", "False", "./uk_2017/output/")
In [92]:
perc_to_count = get_percentage(v_to_count, filename)
In [93]:
display_side_by_side(v_to_count,perc_to_count)
Yes No nan
Have you ever presented your software work at a conference or workshop? 134 101 18
Yes No
Have you ever presented your software work at a conference or workshop? [PERCENTAGE] 57.02 42.98
In [94]:
_ = get_plot(perc_to_count, "y/n/na")

Which conference(s)/workshop(s)

In [95]:
 wc = wordcloud(df, ['conf2can. Which conferences or workshops?'])
<wordcloud.wordcloud.WordCloud object at 0x7fdb91bddda0>
In [96]:
 plt.imshow(wc, interpolation='bilinear')
 plt.axis("off")
Out[96]:
(-0.5, 1499.5, 799.5, -0.5)

Group of question: proj

How many software projects are you currently involved in?

In [97]:
v_to_count, filename  = get_count(df, ['proj1can. How many software projects are you currently involved with?'], "freenumeric", "./../survey_creation/uk_17/listAnswers/.csv", "False", "./uk_2017/output/")
In [98]:
perc_to_count = get_percentage(v_to_count, filename)
In [99]:
display_side_by_side(v_to_count,perc_to_count)
How many software projects are you currently involved with?
0 3.0
1 5.0
2 NaN
3 5.0
4 5.0
5 4.0
6 3.0
7 2.0
8 2.0
9 2.0
10 NaN
11 5.0
12 2.0
13 1.0
14 1.0
15 3.0
16 5.0
17 2.0
18 6.0
19 3.0
20 1.0
21 2.0
22 4.0
23 3.0
24 5.0
25 2.0
26 2.0
27 3.0
28 3.0
29 5.0
30 4.0
31 4.0
32 NaN
33 1.0
34 5.0
35 4.0
36 3.0
37 2.0
38 3.0
39 3.0
40 1.0
41 2.0
42 NaN
43 3.0
44 2.0
45 2.0
46 15.0
47 3.0
48 4.0
49 NaN
50 4.0
51 1.0
52 3.0
53 3.0
54 2.0
55 2.0
56 1.0
57 1.0
58 1.0
59 3.0
60 2.0
61 2.0
62 NaN
63 NaN
64 1.0
65 4.0
66 1.0
67 5.0
68 3.0
69 4.0
70 2.0
71 1.0
72 6.0
73 1.0
74 5.0
75 5.0
76 NaN
77 NaN
78 NaN
79 2.0
80 10.0
81 2.0
82 3.0
83 4.0
84 3.0
85 7.0
86 4.0
87 10.0
88 NaN
89 10.0
90 8.0
91 1.0
92 3.0
93 NaN
94 NaN
95 3.0
96 3.0
97 10.0
98 2.0
99 2.0
100 5.0
101 3.0
102 5.0
103 20.0
104 2.0
105 1.0
106 6.0
107 6.0
108 3.0
109 NaN
110 NaN
111 3.0
112 8.0
113 1.0
114 NaN
115 1.0
116 3.0
117 2.0
118 4.0
119 0.0
120 3.0
121 NaN
122 3.0
123 3.0
124 4.0
125 8.0
126 2.0
127 4.0
128 1.0
129 5.0
130 4.0
131 5.0
132 5.0
133 NaN
134 1.0
135 10.0
136 4.0
137 2.0
138 3.0
139 3.0
140 3.0
141 6.0
142 2.0
143 2.0
144 3.0
145 10.0
146 2.0
147 2.0
148 50.0
149 3.0
150 2.0
151 3.0
152 4.0
153 2.0
154 4.0
155 20.0
156 5.0
157 3.0
158 NaN
159 6.0
160 5.0
161 NaN
162 4.0
163 12.0
164 4.0
165 5.0
166 1.0
167 4.0
168 NaN
169 4.0
170 2.0
171 1.0
172 2.0
173 10.0
174 2.0
175 1.0
176 NaN
177 2.0
178 3.0
179 5.0
180 4.0
181 3.0
182 NaN
183 NaN
184 NaN
185 3.0
186 10.0
187 2.0
188 4.0
189 5.0
190 5.0
191 2.0
192 3.0
193 1.0
194 6.0
195 0.0
196 NaN
197 2.0
198 5.0
199 3.0
200 4.0
201 2.0
202 4.0
203 NaN
204 4.0
205 1.0
206 4.0
207 5.0
208 3.0
209 3.0
210 2.0
211 2.0
212 5.0
213 4.0
214 1.0
215 3.0
216 2.0
217 11.0
218 7.0
219 NaN
220 2.0
221 3.0
222 4.0
223 6.0
224 3.0
225 2.0
226 4.0
227 3.0
228 1.0
229 3.0
230 7.0
231 3.0
232 3.0
233 NaN
234 15.0
235 3.0
236 2.0
237 1.0
238 NaN
239 5.0
240 1.0
241 NaN
242 3.0
243 2.0
244 3.0
245 2.0
246 3.0
247 4.0
248 3.0
249 5.0
250 3.0
251 8.0
252 5.0
How many software projects are you currently involved with? [PERCENTAGE]
0 NaN
1 NaN
2 NaN
3 NaN
4 NaN
5 NaN
6 NaN
7 NaN
8 NaN
9 NaN
10 NaN
11 NaN
12 NaN
13 NaN
14 NaN
15 NaN
16 NaN
17 NaN
18 NaN
19 NaN
20 NaN
21 NaN
22 NaN
23 NaN
24 NaN
25 NaN
26 NaN
27 NaN
28 NaN
29 NaN
30 NaN
31 NaN
32 NaN
33 NaN
34 NaN
35 NaN
36 NaN
37 NaN
38 NaN
39 NaN
40 NaN
41 NaN
42 NaN
43 NaN
44 NaN
45 NaN
46 NaN
47 NaN
48 NaN
49 NaN
50 NaN
51 NaN
52 NaN
53 NaN
54 NaN
55 NaN
56 NaN
57 NaN
58 NaN
59 NaN
60 NaN
61 NaN
62 NaN
63 NaN
64 NaN
65 NaN
66 NaN
67 NaN
68 NaN
69 NaN
70 NaN
71 NaN
72 NaN
73 NaN
74 NaN
75 NaN
76 NaN
77 NaN
78 NaN
79 NaN
80 NaN
81 NaN
82 NaN
83 NaN
84 NaN
85 NaN
86 NaN
87 NaN
88 NaN
89 NaN
90 NaN
91 NaN
92 NaN
93 NaN
94 NaN
95 NaN
96 NaN
97 NaN
98 NaN
99 NaN
100 NaN
101 NaN
102 NaN
103 NaN
104 NaN
105 NaN
106 NaN
107 NaN
108 NaN
109 NaN
110 NaN
111 NaN
112 NaN
113 NaN
114 NaN
115 NaN
116 NaN
117 NaN
118 NaN
119 NaN
120 NaN
121 NaN
122 NaN
123 NaN
124 NaN
125 NaN
126 NaN
127 NaN
128 NaN
129 NaN
130 NaN
131 NaN
132 NaN
133 NaN
134 NaN
135 NaN
136 NaN
137 NaN
138 NaN
139 NaN
140 NaN
141 NaN
142 NaN
143 NaN
144 NaN
145 NaN
146 NaN
147 NaN
148 NaN
149 NaN
150 NaN
151 NaN
152 NaN
153 NaN
154 NaN
155 NaN
156 NaN
157 NaN
158 NaN
159 NaN
160 NaN
161 NaN
162 NaN
163 NaN
164 NaN
165 NaN
166 NaN
167 NaN
168 NaN
169 NaN
170 NaN
171 NaN
172 NaN
173 NaN
174 NaN
175 NaN
176 NaN
177 NaN
178 NaN
179 NaN
180 NaN
181 NaN
182 NaN
183 NaN
184 NaN
185 NaN
186 NaN
187 NaN
188 NaN
189 NaN
190 NaN
191 NaN
192 NaN
193 NaN
194 NaN
195 NaN
196 NaN
197 NaN
198 NaN
199 NaN
200 NaN
201 NaN
202 NaN
203 NaN
204 NaN
205 NaN
206 NaN
207 NaN
208 NaN
209 NaN
210 NaN
211 NaN
212 NaN
213 NaN
214 NaN
215 NaN
216 NaN
217 NaN
218 NaN
219 NaN
220 NaN
221 NaN
222 NaN
223 NaN
224 NaN
225 NaN
226 NaN
227 NaN
228 NaN
229 NaN
230 NaN
231 NaN
232 NaN
233 NaN
234 NaN
235 NaN
236 NaN
237 NaN
238 NaN
239 NaN
240 NaN
241 NaN
242 NaN
243 NaN
244 NaN
245 NaN
246 NaN
247 NaN
248 NaN
249 NaN
250 NaN
251 NaN
252 NaN
In [100]:
_ = get_plot(v_to_count, "freenumeric")
       How many software projects are you currently involved with?
count                                         222.000000          
mean                                            3.959459          
std                                             4.226952          
min                                             0.000000          
25%                                             2.000000          
50%                                             3.000000          
75%                                             5.000000          
max                                            50.000000          

In general, what sort of testing do you conduct on your software? (check all that apply)

In [101]:
v_to_count, filename  = get_count(df, ['proj4can[SQ001]. In general, what sort of testing do you conduct on your software? (check all that apply) [No formal testing]', 'proj4can[SQ002]. In general, what sort of testing do you conduct on your software? (check all that apply) [Developers conduct testing]', 'proj4can[SQ003]. In general, what sort of testing do you conduct on your software? (check all that apply) [Test engineers conduct testing]', 'proj4can[SQ004]. In general, what sort of testing do you conduct on your software? (check all that apply) [Users conduct testing]'], "multiple choices", "./../survey_creation/uk_17/listAnswers/testing.csv", "False", "./uk_2017/output/")
In [102]:
perc_to_count = get_percentage(v_to_count, filename)
In [103]:
display_side_by_side(v_to_count,perc_to_count)
Count
Developers conduct testing 202
Users conduct testing 99
No formal testing 37
Test engineers conduct testing 18
Count [PERCENTAGE]
Developers conduct testing 56.74
Users conduct testing 27.81
No formal testing 10.39
Test engineers conduct testing 5.06
In [104]:
_ = get_plot(perc_to_count, "multiple choices")

Group of question: stability

What is the bus factor of your most important software project?

In [105]:
v_to_count, filename  = get_count(df, ['stability1. What is the bus factor of your most important software project?'], "likert", "./../survey_creation/uk_17/listAnswers/bus_factor.csv", "True", "./uk_2017/output/")
In [106]:
display(v_to_count) 
1 2 3 4 5+
What is the bus factor of your most important software project? 99 72 29 8 25
In [107]:
_ = get_plot(v_to_count, "likert")

Is there a technical hand-over plan for your most important software project?

In [108]:
v_to_count, filename  = get_count(df, ['stability2. Is there a technical handover plan for your most important software project?'], "y/n/na", "./../survey_creation/uk_17/listAnswers/.csv", "False", "./uk_2017/output/")
In [109]:
perc_to_count = get_percentage(v_to_count, filename)
In [110]:
display_side_by_side(v_to_count,perc_to_count)
Yes No nan
Is there a technical handover plan for your most important software project? 61 173 19
Yes No
Is there a technical handover plan for your most important software project? [PERCENTAGE] 26.07 73.93
In [111]:
_ = get_plot(perc_to_count, "y/n/na")

Group of question: open

Have you ever released your software under an open-source licence?

In [112]:
v_to_count, filename  = get_count(df, ['open01can. Have you ever released your software under an open-source licence?'], "y/n/na", "./../survey_creation/uk_17/listAnswers/.csv", "False", "./uk_2017/output/")
In [113]:
perc_to_count = get_percentage(v_to_count, filename)
In [114]:
display_side_by_side(v_to_count,perc_to_count)
Yes No nan
Have you ever released your software under an open-source licence? 158 76 19
Yes No
Have you ever released your software under an open-source licence? [PERCENTAGE] 67.52 32.48
In [115]:
_ = get_plot(perc_to_count, "y/n/na")

How often do you release the software projects you've worked on under an open-source licence?

In [116]:
v_to_count, filename  = get_count(df, ["open1can[A5]. How often do you release the software projects you've worked on under an open-source licence? []"], "likert", "./../survey_creation/uk_17/listAnswers/likert_time_10.csv", "False", "./uk_2017/output/")
In [117]:
display(v_to_count) 
How often do you release the software projects you've worked on under an open-source licence? []
In [118]:
_ = get_plot(v_to_count, "likert")

Have you ever used a Digital Object Identifier (DOI) to identify your software?

In [119]:
v_to_count, filename  = get_count(df, ['open03can. Have you ever used a Digital Object Identifier (DOI) to identify your software?'], "y/n/na", "./../survey_creation/uk_17/listAnswers/.csv", "False", "./uk_2017/output/")
In [120]:
perc_to_count = get_percentage(v_to_count, filename)
In [121]:
display_side_by_side(v_to_count,perc_to_count)
Yes No nan
Have you ever used a Digital Object Identifier (DOI) to identify your software? 52 182 19
Yes No
Have you ever used a Digital Object Identifier (DOI) to identify your software? [PERCENTAGE] 22.22 77.78
In [122]:
_ = get_plot(perc_to_count, "y/n/na")

How often do you associate your software with a Digital Object Identifier (DOI)?

In [123]:
v_to_count, filename  = get_count(df, ['open3can[SQ001]. How often do you associate your software with a Digital Object Identifier (DOI)? []'], "likert", "./../survey_creation/uk_17/listAnswers/likert_time_10.csv", "False", "./uk_2017/output/")
In [124]:
display(v_to_count) 
How often do you associate your software with a Digital Object Identifier (DOI)? []
In [125]:
_ = get_plot(v_to_count, "likert")

Group of question: train

Have you ever trained researchers in computational techniques?

In [126]:
v_to_count, filename  = get_count(df, ['train1. Have you ever trained researchers in computational techniques?'], "y/n/na", "./../survey_creation/uk_17/listAnswers/.csv", "False", "./uk_2017/output/")
In [127]:
perc_to_count = get_percentage(v_to_count, filename)
In [128]:
display_side_by_side(v_to_count,perc_to_count)
Yes No nan
Have you ever trained researchers in computational techniques? 152 82 19
Yes No
Have you ever trained researchers in computational techniques? [PERCENTAGE] 64.96 35.04
In [129]:
_ = get_plot(perc_to_count, "y/n/na")

On average, how many times a year do you take part in providing training?

In [130]:
v_to_count, filename  = get_count(df, ['train2. On average, how many times a year do you take part in providing training?'], "freenumeric", "./../survey_creation/uk_17/listAnswers/.csv", "False", "./uk_2017/output/")
In [131]:
perc_to_count = get_percentage(v_to_count, filename)
In [132]:
display_side_by_side(v_to_count,perc_to_count)
On average, how many times a year do you take part in providing training?
0 NaN
1 NaN
2 NaN
3 4.0
4 NaN
5 NaN
6 NaN
7 NaN
8 2.0
9 NaN
10 15.0
11 NaN
12 2.0
13 3.0
14 NaN
15 NaN
16 2.0
17 NaN
18 1.0
19 3.0
20 2.0
21 NaN
22 NaN
23 3.0
24 NaN
25 3.0
26 NaN
27 2.0
28 NaN
29 NaN
30 NaN
31 1.0
32 NaN
33 2.0
34 2.0
35 1.0
36 NaN
37 NaN
38 NaN
39 2.0
40 2.0
41 4.0
42 NaN
43 1.0
44 NaN
45 2.0
46 1.0
47 2.0
48 4.0
49 NaN
50 5.0
51 NaN
52 2.0
53 NaN
54 NaN
55 5.0
56 2.0
57 3.0
58 NaN
59 3.0
60 2.0
61 4.0
62 NaN
63 NaN
64 3.0
65 NaN
66 3.0
67 2.0
68 NaN
69 NaN
70 2.0
71 2.0
72 2.0
73 NaN
74 2.0
75 2.0
76 NaN
77 NaN
78 NaN
79 1.0
80 10.0
81 2.0
82 NaN
83 NaN
84 NaN
85 NaN
86 NaN
87 0.0
88 3.0
89 3.0
90 0.0
91 4.0
92 0.0
93 NaN
94 NaN
95 1.0
96 2.0
97 2.0
98 2.0
99 2.0
100 3.0
101 1.0
102 2.0
103 4.0
104 6.0
105 3.0
106 4.0
107 NaN
108 NaN
109 NaN
110 NaN
111 3.0
112 5.0
113 2.0
114 1.0
115 2.0
116 2.0
117 2.0
118 NaN
119 15.0
120 1.0
121 NaN
122 2.0
123 2.0
124 30.0
125 6.0
126 NaN
127 1.0
128 NaN
129 1.0
130 2.0
131 1.0
132 3.0
133 1.0
134 NaN
135 NaN
136 1.0
137 NaN
138 6.0
139 4.0
140 2.0
141 6.0
142 NaN
143 NaN
144 3.0
145 NaN
146 2.0
147 NaN
148 NaN
149 NaN
150 3.0
151 1.0
152 2.0
153 NaN
154 NaN
155 0.0
156 4.0
157 NaN
158 1.0
159 NaN
160 2.0
161 NaN
162 NaN
163 24.0
164 4.0
165 NaN
166 NaN
167 3.0
168 NaN
169 6.0
170 4.0
171 NaN
172 5.0
173 3.0
174 1.0
175 5.0
176 NaN
177 2.0
178 2.0
179 2.0
180 NaN
181 NaN
182 NaN
183 NaN
184 NaN
185 4.0
186 3.0
187 3.0
188 7.0
189 NaN
190 NaN
191 0.0
192 NaN
193 1.0
194 2.0
195 5.0
196 NaN
197 3.0
198 NaN
199 16.0
200 NaN
201 NaN
202 NaN
203 NaN
204 6.0
205 4.0
206 NaN
207 NaN
208 NaN
209 10.0
210 3.0
211 3.0
212 12.0
213 1.0
214 NaN
215 NaN
216 4.0
217 8.0
218 4.0
219 NaN
220 NaN
221 3.0
222 3.0
223 NaN
224 3.0
225 NaN
226 NaN
227 2.0
228 1.0
229 2.0
230 2.0
231 1.0
232 NaN
233 NaN
234 3.0
235 NaN
236 0.0
237 NaN
238 NaN
239 2.0
240 NaN
241 NaN
242 1.0
243 NaN
244 1.0
245 1.0
246 3.0
247 20.0
248 2.0
249 NaN
250 3.0
251 2.0
252 2.0
On average, how many times a year do you take part in providing training? [PERCENTAGE]
0 NaN
1 NaN
2 NaN
3 NaN
4 NaN
5 NaN
6 NaN
7 NaN
8 NaN
9 NaN
10 NaN
11 NaN
12 NaN
13 NaN
14 NaN
15 NaN
16 NaN
17 NaN
18 NaN
19 NaN
20 NaN
21 NaN
22 NaN
23 NaN
24 NaN
25 NaN
26 NaN
27 NaN
28 NaN
29 NaN
30 NaN
31 NaN
32 NaN
33 NaN
34 NaN
35 NaN
36 NaN
37 NaN
38 NaN
39 NaN
40 NaN
41 NaN
42 NaN
43 NaN
44 NaN
45 NaN
46 NaN
47 NaN
48 NaN
49 NaN
50 NaN
51 NaN
52 NaN
53 NaN
54 NaN
55 NaN
56 NaN
57 NaN
58 NaN
59 NaN
60 NaN
61 NaN
62 NaN
63 NaN
64 NaN
65 NaN
66 NaN
67 NaN
68 NaN
69 NaN
70 NaN
71 NaN
72 NaN
73 NaN
74 NaN
75 NaN
76 NaN
77 NaN
78 NaN
79 NaN
80 NaN
81 NaN
82 NaN
83 NaN
84 NaN
85 NaN
86 NaN
87 NaN
88 NaN
89 NaN
90 NaN
91 NaN
92 NaN
93 NaN
94 NaN
95 NaN
96 NaN
97 NaN
98 NaN
99 NaN
100 NaN
101 NaN
102 NaN
103 NaN
104 NaN
105 NaN
106 NaN
107 NaN
108 NaN
109 NaN
110 NaN
111 NaN
112 NaN
113 NaN
114 NaN
115 NaN
116 NaN
117 NaN
118 NaN
119 NaN
120 NaN
121 NaN
122 NaN
123 NaN
124 NaN
125 NaN
126 NaN
127 NaN
128 NaN
129 NaN
130 NaN
131 NaN
132 NaN
133 NaN
134 NaN
135 NaN
136 NaN
137 NaN
138 NaN
139 NaN
140 NaN
141 NaN
142 NaN
143 NaN
144 NaN
145 NaN
146 NaN
147 NaN
148 NaN
149 NaN
150 NaN
151 NaN
152 NaN
153 NaN
154 NaN
155 NaN
156 NaN
157 NaN
158 NaN
159 NaN
160 NaN
161 NaN
162 NaN
163 NaN
164 NaN
165 NaN
166 NaN
167 NaN
168 NaN
169 NaN
170 NaN
171 NaN
172 NaN
173 NaN
174 NaN
175 NaN
176 NaN
177 NaN
178 NaN
179 NaN
180 NaN
181 NaN
182 NaN
183 NaN
184 NaN
185 NaN
186 NaN
187 NaN
188 NaN
189 NaN
190 NaN
191 NaN
192 NaN
193 NaN
194 NaN
195 NaN
196 NaN
197 NaN
198 NaN
199 NaN
200 NaN
201 NaN
202 NaN
203 NaN
204 NaN
205 NaN
206 NaN
207 NaN
208 NaN
209 NaN
210 NaN
211 NaN
212 NaN
213 NaN
214 NaN
215 NaN
216 NaN
217 NaN
218 NaN
219 NaN
220 NaN
221 NaN
222 NaN
223 NaN
224 NaN
225 NaN
226 NaN
227 NaN
228 NaN
229 NaN
230 NaN
231 NaN
232 NaN
233 NaN
234 NaN
235 NaN
236 NaN
237 NaN
238 NaN
239 NaN
240 NaN
241 NaN
242 NaN
243 NaN
244 NaN
245 NaN
246 NaN
247 NaN
248 NaN
249 NaN
250 NaN
251 NaN
252 NaN
In [133]:
_ = get_plot(v_to_count, "freenumeric")
       On average, how many times a year do you take part in providing training?
count                                         146.000000                        
mean                                            3.431507                        
std                                             4.071613                        
min                                             0.000000                        
25%                                             2.000000                        
50%                                             2.000000                        
75%                                             4.000000                        
max                                            30.000000                        

What training programs are you involved with (comma separated list) (For example, Software Carpentry, local university training, etc.)

In [134]:
 wc = wordcloud(df, ['train3. What training programs are you involved with (comma separated list)?'])
<wordcloud.wordcloud.WordCloud object at 0x7fdb918df860>
In [135]:
 plt.imshow(wc, interpolation='bilinear')
 plt.axis("off")
Out[135]:
(-0.5, 1499.5, 799.5, -0.5)

Group of question: fund

Do you know the source of the funding used to support you and your current, largest project?

In [136]:
v_to_count, filename  = get_count(df, ['fund1. Do you know the source of the funding used to support you and your current, largest project?'], "y/n/na", "./../survey_creation/uk_17/listAnswers/.csv", "False", "./uk_2017/output/")
In [137]:
perc_to_count = get_percentage(v_to_count, filename)
In [138]:
display_side_by_side(v_to_count,perc_to_count)
Yes No nan
Do you know the source of the funding used to support you and your current, largest project? 199 23 31
Yes No
Do you know the source of the funding used to support you and your current, largest project? [PERCENTAGE] 89.64 10.36
In [139]:
_ = get_plot(perc_to_count, "y/n/na")

Which of the following sources are used to pay for your effort as an RSE/equivalent?

In [140]:
v_to_count, filename  = get_count(df, ['fund2[SQ001]. Which of the following sources are used to pay for your effort as an RSE/equivalent? [I volunteer my time]', 'fund2[SQ002]. Which of the following sources are used to pay for your effort as an RSE/equivalent? [Donation button]', 'fund2[SQ003]. Which of the following sources are used to pay for your effort as an RSE/equivalent? [Crowdfunding (one-time)]', 'fund2[SQ004]. Which of the following sources are used to pay for your effort as an RSE/equivalent? [Crowdfunding (recurring)]', 'fund2[SQ005]. Which of the following sources are used to pay for your effort as an RSE/equivalent? [Books & merchandise]', 'fund2[SQ006]. Which of the following sources are used to pay for your effort as an RSE/equivalent? [Advertising & sponsorships]', 'fund2[SQ007]. Which of the following sources are used to pay for your effort as an RSE/equivalent? [Industry support]', 'fund2[SQ008]. Which of the following sources are used to pay for your effort as an RSE/equivalent? [Consulting & services]', 'fund2[SQ009]. Which of the following sources are used to pay for your effort as an RSE/equivalent? [Grants]', 'fund2[SQ010]. Which of the following sources are used to pay for your effort as an RSE/equivalent? [SaaS]', 'fund2[SQ011]. Which of the following sources are used to pay for your effort as an RSE/equivalent? [Membership]', 'fund2[SQ012]. Which of the following sources are used to pay for your effort as an RSE/equivalent? [Dual license]', 'fund2[SQ013]. Which of the following sources are used to pay for your effort as an RSE/equivalent? [Open core]', 'fund2[SQ014]. Which of the following sources are used to pay for your effort as an RSE/equivalent? [Foundations & consortiums]', 'fund2[SQ015]. Which of the following sources are used to pay for your effort as an RSE/equivalent? [Venture capital]', 'fund2[SQ016]. Which of the following sources are used to pay for your effort as an RSE/equivalent? [Trademark licensing & franchising]'], "multiple choices", "./../survey_creation/uk_17/listAnswers/funding.csv", "False", "./uk_2017/output/")
In [141]:
perc_to_count = get_percentage(v_to_count, filename)
In [142]:
display_side_by_side(v_to_count,perc_to_count)
Count
Grants 163
Industry support 33
I volunteer my time 27
Consulting & services 17
Open core 7
Foundations & consortiums 6
Trademark licensing & franchising 3
SaaS 2
Membership 2
Dual license 2
Crowdfunding (one-time) 1
Books & merchandise 1
Advertising & sponsorships 1
Venture capital 1
Donation button 0
Crowdfunding (recurring) 0
Count [PERCENTAGE]
Grants 61.28
Industry support 12.41
I volunteer my time 10.15
Consulting & services 6.39
Open core 2.63
Foundations & consortiums 2.26
Trademark licensing & franchising 1.13
SaaS 0.75
Membership 0.75
Dual license 0.75
Crowdfunding (one-time) 0.38
Books & merchandise 0.38
Advertising & sponsorships 0.38
Venture capital 0.38
Donation button 0.00
Crowdfunding (recurring) 0.00
In [143]:
_ = get_plot(perc_to_count, "multiple choices")

Which of the following sources are used to fund your current, largest project?

In [144]:
v_to_count, filename  = get_count(df, ['fund3[SQ001]. Which of the following sources are used to fund your current, largest project? [Volunteers]', 'fund3[SQ002]. Which of the following sources are used to fund your current, largest project? [Donation button]', 'fund3[SQ003]. Which of the following sources are used to fund your current, largest project? [Crowdfunding (one-time)]', 'fund3[SQ004]. Which of the following sources are used to fund your current, largest project? [Crowdfunding (recurring)]', 'fund3[SQ005]. Which of the following sources are used to fund your current, largest project? [Books & merchandise]', 'fund3[SQ006]. Which of the following sources are used to fund your current, largest project? [Advertising & sponsorships]', 'fund3[SQ007]. Which of the following sources are used to fund your current, largest project? [Industry support]', 'fund3[SQ008]. Which of the following sources are used to fund your current, largest project? [Consulting & services]', 'fund3[SQ009]. Which of the following sources are used to fund your current, largest project? [Grants]', 'fund3[SQ010]. Which of the following sources are used to fund your current, largest project? [SaaS]', 'fund3[SQ011]. Which of the following sources are used to fund your current, largest project? [Membership]', 'fund3[SQ012]. Which of the following sources are used to fund your current, largest project? [Dual license]', 'fund3[SQ013]. Which of the following sources are used to fund your current, largest project? [Open core]', 'fund3[SQ014]. Which of the following sources are used to fund your current, largest project? [Foundations & consortiums]', 'fund3[SQ015]. Which of the following sources are used to fund your current, largest project? [Venture capital]', 'fund3[SQ016]. Which of the following sources are used to fund your current, largest project? [Trademark licensing & franchising]'], "multiple choices", "./../survey_creation/uk_17/listAnswers/funding.csv", "False", "./uk_2017/output/")
In [145]:
perc_to_count = get_percentage(v_to_count, filename)
In [146]:
display_side_by_side(v_to_count,perc_to_count)
Count
Grants 157
Industry support 30
Volunteers 15
Consulting & services 8
Foundations & consortiums 5
Membership 2
Dual license 2
Open core 2
Trademark licensing & franchising 2
Donation button 1
Crowdfunding (one-time) 1
SaaS 1
Venture capital 1
Crowdfunding (recurring) 0
Books & merchandise 0
Advertising & sponsorships 0
Count [PERCENTAGE]
Grants 69.16
Industry support 13.22
Volunteers 6.61
Consulting & services 3.52
Foundations & consortiums 2.20
Membership 0.88
Dual license 0.88
Open core 0.88
Trademark licensing & franchising 0.88
Donation button 0.44
Crowdfunding (one-time) 0.44
SaaS 0.44
Venture capital 0.44
Crowdfunding (recurring) 0.00
Books & merchandise 0.00
Advertising & sponsorships 0.00
In [147]:
_ = get_plot(perc_to_count, "multiple choices")

Which of the following sources were used to fund your current, largest project three years ago?

In [148]:
v_to_count, filename  = get_count(df, ['fund4[SQ017]. Which of the following sources were used to fund your current, largest project three years ago? [My previous project is less than 3 years old]', 'fund4[SQ001]. Which of the following sources were used to fund your current, largest project three years ago? [Volunteers]', 'fund4[SQ002]. Which of the following sources were used to fund your current, largest project three years ago? [Donation button]', 'fund4[SQ003]. Which of the following sources were used to fund your current, largest project three years ago? [Crowdfunding (one-time)]', 'fund4[SQ004]. Which of the following sources were used to fund your current, largest project three years ago? [Crowdfunding (recurring)]', 'fund4[SQ005]. Which of the following sources were used to fund your current, largest project three years ago? [Books & merchandise]', 'fund4[SQ006]. Which of the following sources were used to fund your current, largest project three years ago? [Advertising & sponsorships]', 'fund4[SQ007]. Which of the following sources were used to fund your current, largest project three years ago? [Industry support]', 'fund4[SQ008]. Which of the following sources were used to fund your current, largest project three years ago? [Consulting & services]', 'fund4[SQ009]. Which of the following sources were used to fund your current, largest project three years ago? [Grants]', 'fund4[SQ010]. Which of the following sources were used to fund your current, largest project three years ago? [SaaS]', 'fund4[SQ011]. Which of the following sources were used to fund your current, largest project three years ago? [Membership]', 'fund4[SQ012]. Which of the following sources were used to fund your current, largest project three years ago? [Dual license]', 'fund4[SQ013]. Which of the following sources were used to fund your current, largest project three years ago? [Open core]', 'fund4[SQ014]. Which of the following sources were used to fund your current, largest project three years ago? [Foundations & consortiums]', 'fund4[SQ015]. Which of the following sources were used to fund your current, largest project three years ago? [Venture capital]', 'fund4[SQ016]. Which of the following sources were used to fund your current, largest project three years ago? [Trademark licensing & franchising]'], "multiple choices", "./../survey_creation/uk_17/listAnswers/funding.csv", "False", "./uk_2017/output/")
In [149]:
perc_to_count = get_percentage(v_to_count, filename)
In [150]:
display_side_by_side(v_to_count,perc_to_count)
Count
Grants 127
My previous project is less than 3 years old 24
Industry support 23
Volunteers 10
Consulting & services 6
Dual license 2
Open core 2
Advertising & sponsorships 1
SaaS 1
Foundations & consortiums 1
Venture capital 1
Trademark licensing & franchising 1
Books & merchandise 0
Crowdfunding (recurring) 0
Crowdfunding (one-time) 0
Donation button 0
Membership 0
Count [PERCENTAGE]
Grants 63.82
My previous project is less than 3 years old 12.06
Industry support 11.56
Volunteers 5.03
Consulting & services 3.02
Dual license 1.01
Open core 1.01
Advertising & sponsorships 0.50
SaaS 0.50
Foundations & consortiums 0.50
Venture capital 0.50
Trademark licensing & franchising 0.50
Books & merchandise 0.00
Crowdfunding (recurring) 0.00
Crowdfunding (one-time) 0.00
Donation button 0.00
Membership 0.00
In [151]:
_ = get_plot(perc_to_count, "multiple choices")

Section: 5

Group of question: perfCheck

Do you receive sufficient information on the results of your work?; Does your work give you the opportunity to check on how well you are doing your work?; In your work, do you have access to sufficient data and information?; Do you receive sufficient information on the purpose of your work?; Does your work provide you with direct feedback on how well you are doing your work?; Does your supervisor/line manager inform you about how well you are doing your work?; Do your colleagues inform you about how well you are doing your work?

In [152]:
v_to_count, filename  = get_count(df, ['likerttime1[perfCheck1]. Please rate the following propositions. There are no right or wrong answers [Do you receive sufficient information on the results of your work?]', 'likerttime1[perfCheck2]. Please rate the following propositions. There are no right or wrong answers [Do you get the opportunity to check on how well you are doing your work?]', 'likerttime1[perfCheck3]. Please rate the following propositions. There are no right or wrong answers [Do you have access to sufficient data and information to do your work?]', 'likertime2[perfCheck4]. Please rate the following propositions. There are no right or wrong answers [Do you receive sufficient information on the purpose of your work?]', 'likertime2[perfCheck5]. Please rate the following propositions. There are no right or wrong answers [Does your work provide you with direct feedback on how well you are doing?]', 'likertime2[perfCheck6]. Please rate the following propositions. There are no right or wrong answers [Does your supervisor/line manager inform you about how well you are doing your work?]', 'likerttime1[perfCheck7]. Please rate the following propositions. There are no right or wrong answers [Do your colleagues inform you about how well you are doing your work?]'], "likert", "./../survey_creation/uk_17/listAnswers/likert_time_5.csv", "False", "./uk_2017/output/")
In [153]:
display(v_to_count) 
Never Sometimes Often Very Often Always
Do you receive sufficient information on the results of your work? 5 74 62 51 21
Do you get the opportunity to check on how well you are doing your work? 17 90 68 30 8
Do you have access to sufficient data and information to do your work? 1 44 62 75 40
Do you receive sufficient information on the purpose of your work? 6 38 73 50 47
Does your work provide you with direct feedback on how well you are doing? 15 115 41 29 13
Does your supervisor/line manager inform you about how well you are doing your work? 13 97 53 28 16
Do your colleagues inform you about how well you are doing your work? 20 110 48 32 7
In [154]:
_ = get_plot(v_to_count, "likert")

Group of question: turnOver

In [155]:
v_to_count, filename  = get_count(df, ['likertime2[turnOver1]. Please rate the following propositions. There are no right or wrong answers [How often do you feel frustrated when not given the opportunity to achieve your personal work-related goals?]', 'likertime2[turnOver2]. Please rate the following propositions. There are no right or wrong answers [How often do you look forward to another day at work?]', 'likerttime1[turnOver3]. Please rate the following propositions. There are no right or wrong answers [How often do you consider leaving your job?]', 'likerttime1[turnOver4]. Please rate the following propositions. There are no right or wrong answers [How often do you dream about getting another job that will better suit your needs?]'], "likert", "./../survey_creation/uk_17/listAnswers/likert_time_5.csv", "False", "./uk_2017/output/")
In [156]:
display(v_to_count) 
Never Sometimes Often Very Often Always
How often do you feel frustrated when not given the opportunity to achieve your personal work-related goals? 32 98 41 24 12
How often do you look forward to another day at work? 7 72 74 43 19
How often do you consider leaving your job? 45 111 30 24 9
How often do you dream about getting another job that will better suit your needs? 43 99 23 34 19
In [157]:
_ = get_plot(v_to_count, "likert")

My current job satisfies my personal needs; I would accept another job at the same compensation level if I was offered it

In [158]:
v_to_count, filename  = get_count(df, ['likertagree1[turnOver5]. Please rate the following propositions. There are no right or wrong answers [My current job satisfies my personal needs]', 'likertagree3[turnOver6]. Please rate the following propositions. There are no right or wrong answers [I would accept another job at the same compensation level if I was offered it]'], "likert", "./../survey_creation/uk_17/listAnswers/likert_agree.csv", "False", "./uk_2017/output/")
In [159]:
display(v_to_count) 
Strongly disagree Disagree Neither agree or disagree Agree Strongly Agree
My current job satisfies my personal needs 9 26 36 113 37
I would accept another job at the same compensation level if I was offered it 27 64 72 33 13
In [160]:
_ = get_plot(v_to_count, "likert")

Group of question: affRec

I am satisfied with my supervisor/line manager's confidence in me; I am satisfied with a word of thanks from my supervisor/line manager; I am satisfied with the recognition I receive from my supervisor/line manager for doing my job; I am satisfied with the compliments from my supervisor/line manager concerning my work; I am satisfied with the encouragement from my supervisor/line manager while doing my job

In [161]:
v_to_count, filename  = get_count(df, ["likertagree3[affRec1]. Please rate the following propositions. There are no right or wrong answers [I am satisfied with my supervisor/line manager's confidence in me]", 'likertagree2[affRec2]. Please rate the following propositions. There are no right or wrong answers [I am satisfied with a word of thanks from my supervisor/line manager]', 'likertagree2[affRec3]. Please rate the following propositions. There are no right or wrong answers [I am satisfied with the recognition I receive from my supervisor/line manager for doing my job]', 'likertagree1[affRec4]. Please rate the following propositions. There are no right or wrong answers [I am satisfied with the compliments from my supervisor/line manager concerning my work]', 'likertagree3[affRec5]. Please rate the following propositions. There are no right or wrong answers [I am satisfied with the encouragement from my supervisor/line manager while doing my job]'], "likert", "./../survey_creation/uk_17/listAnswers/likert_agree.csv", "False", "./uk_2017/output/")
In [162]:
display(v_to_count) 
Strongly disagree Disagree Neither agree or disagree Agree Strongly Agree
I am satisfied with my supervisor/line manager's confidence in me 6 13 30 105 56
I am satisfied with a word of thanks from my supervisor/line manager 8 22 58 89 37
I am satisfied with the recognition I receive from my supervisor/line manager for doing my job 9 31 37 90 51
I am satisfied with the compliments from my supervisor/line manager concerning my work 8 37 43 101 19
I am satisfied with the encouragement from my supervisor/line manager while doing my job 9 33 46 87 34
In [163]:
_ = get_plot(v_to_count, "likert")

Group of question: percEmp

It would not be very difficult for me to get an equivalent job in a different organisation; I can think of a number of organisations that would probably offer me a job; My experience is in demand on the labour market; Given my qualifications and experience, getting a new job would not be very hard at all

In [164]:
v_to_count, filename  = get_count(df, ['likertagree1[percEmp1]. Please rate the following propositions. There are no right or wrong answers [It would not be very difficult for me to get an equivalent job in a different organisation]', 'likertagree2[percEmp2]. Please rate the following propositions. There are no right or wrong answers [I can think of a number of organisations that would probably offer me a job]', 'likertagree1[percEmp3]. Please rate the following propositions. There are no right or wrong answers [My experience is in demand on the labour market]', 'likertagree3[percEmp4]. Please rate the following propositions. There are no right or wrong answers [Given my qualifications and experience, getting a new job would not be very hard at all]'], "likert", "./../survey_creation/uk_17/listAnswers/likert_agree.csv", "False", "./uk_2017/output/")
In [165]:
display(v_to_count) 
Strongly disagree Disagree Neither agree or disagree Agree Strongly Agree
It would not be very difficult for me to get an equivalent job in a different organisation 11 39 64 61 36
I can think of a number of organisations that would probably offer me a job 2 14 35 101 69
My experience is in demand on the labour market 3 13 37 101 60
Given my qualifications and experience, getting a new job would not be very hard at all 8 20 66 75 44
In [166]:
_ = get_plot(v_to_count, "likert")

Group of question: affSat

I find real enjoyment in my job; Most days I am enthusiastic about my job; I feel fairly well satisfied with my job; I like my job better than the average person

In [167]:
v_to_count, filename  = get_count(df, ['likertagree2[affSat1]. Please rate the following propositions. There are no right or wrong answers [I find real enjoyment in my job]', 'likertagree2[affSat2]. Please rate the following propositions. There are no right or wrong answers [Most days I am enthusiastic about my job]', 'likertagree1[affSat3]. Please rate the following propositions. There are no right or wrong answers [I feel satisfied with my job]', 'likertagree3[affSat4]. Please rate the following propositions. There are no right or wrong answers [I like my job more than average]'], "likert", "./../survey_creation/uk_17/listAnswers/likert_agree.csv", "False", "./uk_2017/output/")
In [168]:
display(v_to_count) 
Strongly disagree Disagree Neither agree or disagree Agree Strongly Agree
I find real enjoyment in my job 4 10 20 95 96
Most days I am enthusiastic about my job 5 16 37 115 50
I feel satisfied with my job 9 26 40 111 29
I like my job more than average 7 10 41 101 55
In [169]:
_ = get_plot(v_to_count, "likert")

Group of question: satisGen

In general, how satisfied are you with Your current position

In [170]:
v_to_count, filename  = get_count(df, ['satisGen1[SQ001]. In general, how satisfied are you with: [Your current position]', 'satisGen1[SQ002]. In general, how satisfied are you with: [Your career]'], "likert", "./../survey_creation/uk_17/listAnswers/likert_satisfied_10.csv", "False", "./uk_2017/output/")
In [171]:
display(v_to_count) 
0 - Not at all satisfied 1 2 3 4 5 6 7 8 9 10 - Completely satisfied
Your current position 7 4 6 4 12 18 32 61 44 26 5
Your career 4 4 7 14 19 23 37 55 41 12 4
In [172]:
_ = get_plot(v_to_count, "likert")

Section: 6

Group of question: socio

Please select your gender

In [173]:
v_to_count, filename  = get_count(df, ['socio2. Please select your gender'], "one choice", "./../survey_creation/uk_17/listAnswers/gender.csv", "False", "./uk_2017/output/")
In [174]:
perc_to_count = get_percentage(v_to_count, filename)
In [175]:
display_side_by_side(v_to_count,perc_to_count)
Please select your gender
Male 183
Female 33
Other 1
NaN 36
Please select your gender [PERCENTAGE]
Male 84.33
Female 15.21
Other 0.46
In [176]:
_ = get_plot(perc_to_count, "one choice")

Please select your age

In [177]:
v_to_count, filename  = get_count(df, ['socio3. Please select your age'], "one choice", "./../survey_creation/uk_17/listAnswers/age.csv", "True", "./uk_2017/output/")
In [178]:
perc_to_count = get_percentage(v_to_count, filename)
In [179]:
display_side_by_side(v_to_count,perc_to_count)
Please select your age
18 to 24 years 88
25 to 34 years 78
35 to 44 years 38
45 to 54 years 33
55 to 64 years 12
NaN 4
Please select your age [PERCENTAGE]
18 to 24 years 35.34
25 to 34 years 31.33
35 to 44 years 15.26
45 to 54 years 13.25
55 to 64 years 4.82
In [180]:
_ = get_plot(perc_to_count, "one choice")

How would you describe your ethnic origin? This refers to people who share the same cultural background and identity, not country of birth or nationality.

In [181]:
v_to_count, filename  = get_count(df, ['socio5. How would you describe your ethnic origin? This refers to people who share the same cultural background and identity, not country of birth or nationality.'], "one choice", "./../survey_creation/uk_17/listAnswers/ethnicity.csv", "False", "./uk_2017/output/")
In [182]:
perc_to_count = get_percentage(v_to_count, filename)
In [183]:
display_side_by_side(v_to_count,perc_to_count)
How would you describe your ethnic origin? This refers to people who share the same cultural background and identity, not country of birth or nationality.
White: English / Welsh / Scottish / Northern Irish / British 136
White: Any other White background 57
Asian / Asian British: Chinese 5
White: Irish 3
Other Ethnic group: Any other ethnic group 3
Black / African / Caribbean / Black Britsh: African 2
Mixed / Multipe Ethnic groups: Any other Mixed / Multiple ethnic background, 2
Asian / Asian British: Indian 1
Mixed / Multipe Ethnic groups: White and Black Caribbean 1
NaN 43
How would you describe your ethnic origin? This refers to people who share the same cultural background and identity, not country of birth or nationality. [PERCENTAGE]
White: English / Welsh / Scottish / Northern Irish / British 64.76
White: Any other White background 27.14
Asian / Asian British: Chinese 2.38
White: Irish 1.43
Other Ethnic group: Any other ethnic group 1.43
Black / African / Caribbean / Black Britsh: African 0.95
Mixed / Multipe Ethnic groups: Any other Mixed / Multiple ethnic background, 0.95
Asian / Asian British: Indian 0.48
Mixed / Multipe Ethnic groups: White and Black Caribbean 0.48
In [184]:
_ = get_plot(perc_to_count, "one choice")

Please select the range of your salary

In [185]:
v_to_count, filename  = get_count(df, ['socio4. Please select the range of your salary'], "one choice", "./../survey_creation/uk_17/listAnswers/salary.csv", "True", "./uk_2017/output/")
In [186]:
perc_to_count = get_percentage(v_to_count, filename)
In [187]:
display_side_by_side(v_to_count,perc_to_count)
Please select the range of your salary
< £18,031 99
≥ £18,031 and < £24,057 56
≥ £24,057 and < £32,277 43
≥ £32,277 and < £43,325 36
≥ £43,325 and < £58,172 8
≥ £58,172 7
NaN 4
Please select the range of your salary [PERCENTAGE]
< £18,031 39.76
≥ £18,031 and < £24,057 22.49
≥ £24,057 and < £32,277 17.27
≥ £32,277 and < £43,325 14.46
≥ £43,325 and < £58,172 3.21
≥ £58,172 2.81
In [188]:
_ = get_plot(perc_to_count, "one choice")

Group of question: disa

Do you have a condition that is defined as a disability by the Equality Act 2010*

In [189]:
v_to_count, filename  = get_count(df, ['disa1. Do you have a condition that is defined as a disability by the Equality Act 2010*'], "y/n/na", "./../survey_creation/uk_17/listAnswers/.csv", "False", "./uk_2017/output/")
In [190]:
perc_to_count = get_percentage(v_to_count, filename)
In [191]:
display_side_by_side(v_to_count,perc_to_count)
Yes No nan
Do you have a condition that is defined as a disability by the Equality Act 2010* 10 199 44
Yes No
Do you have a condition that is defined as a disability by the Equality Act 2010* [PERCENTAGE] 4.78 95.22
In [192]:
_ = get_plot(perc_to_count, "y/n/na")

Section: 7

Group of question: tool

What Operating System do you prefer to use at work?

In [193]:
v_to_count, filename  = get_count(df, ['tool2. What Operating System do you prefer to use at work?'], "one choice", "./../survey_creation/uk_17/listAnswers/os.csv", "False", "./uk_2017/output/")
In [194]:
perc_to_count = get_percentage(v_to_count, filename)
In [195]:
display_side_by_side(v_to_count,perc_to_count)
What Operating System do you prefer to use at work?
GNU/Linux 135
OS X 51
Windows 33
Other 2
NaN 32
What Operating System do you prefer to use at work? [PERCENTAGE]
GNU/Linux 61.09
OS X 23.08
Windows 14.93
Other 0.90
In [196]:
_ = get_plot(perc_to_count, "one choice")

What programming languages do you use at work? (Select as many as apply)

In [197]:
v_to_count, filename  = get_count(df, ['tool4can[SQ001]. What programming languages do you use at work? (Select as many as apply) [Assembly]', 'tool4can[SQ002]. What programming languages do you use at work? (Select as many as apply) [C]', 'tool4can[SQ003]. What programming languages do you use at work? (Select as many as apply) [C#]', 'tool4can[SQ004]. What programming languages do you use at work? (Select as many as apply) [C++]', 'tool4can[SQ005]. What programming languages do you use at work? (Select as many as apply) [Clojure]', 'tool4can[SQ006]. What programming languages do you use at work? (Select as many as apply) [CoffeeScript]', 'tool4can[SQ007]. What programming languages do you use at work? (Select as many as apply) [Common Lisp]', 'tool4can[SQ036]. What programming languages do you use at work? (Select as many as apply) [CUDA]', 'tool4can[SQ008]. What programming languages do you use at work? (Select as many as apply) [Dart]', 'tool4can[SQ009]. What programming languages do you use at work? (Select as many as apply) [Elixir]', 'tool4can[SQ010]. What programming languages do you use at work? (Select as many as apply) [Erlang]', 'tool4can[SQ011]. What programming languages do you use at work? (Select as many as apply) [F#]', 'tool4can[SQ037]. What programming languages do you use at work? (Select as many as apply) [FORTRAN]', 'tool4can[SQ012]. What programming languages do you use at work? (Select as many as apply) [Go]', 'tool4can[SQ013]. What programming languages do you use at work? (Select as many as apply) [Groovy]', 'tool4can[SQ014]. What programming languages do you use at work? (Select as many as apply) [Hack]', 'tool4can[SQ015]. What programming languages do you use at work? (Select as many as apply) [Haskell]', 'tool4can[SQ016]. What programming languages do you use at work? (Select as many as apply) [Java]', 'tool4can[SQ017]. What programming languages do you use at work? (Select as many as apply) [JavaScript]', 'tool4can[SQ018]. What programming languages do you use at work? (Select as many as apply) [Julia]', 'tool4can[SQ019]. What programming languages do you use at work? (Select as many as apply) [Lua]', 'tool4can[SQ038]. What programming languages do you use at work? (Select as many as apply) [Markup languages (HTML, markdown,...)]', 'tool4can[SQ020]. What programming languages do you use at work? (Select as many as apply) [Matlab]', 'tool4can[SQ021]. What programming languages do you use at work? (Select as many as apply) [Objective-C]', 'tool4can[SQ022]. What programming languages do you use at work? (Select as many as apply) [Perl]', 'tool4can[SQ023]. What programming languages do you use at work? (Select as many as apply) [PHP]', 'tool4can[SQ024]. What programming languages do you use at work? (Select as many as apply) [Python]', 'tool4can[SQ025]. What programming languages do you use at work? (Select as many as apply) [R]', 'tool4can[SQ026]. What programming languages do you use at work? (Select as many as apply) [Ruby]', 'tool4can[SQ027]. What programming languages do you use at work? (Select as many as apply) [Rust]', 'tool4can[SQ028]. What programming languages do you use at work? (Select as many as apply) [Scala]', 'tool4can[SQ029]. What programming languages do you use at work? (Select as many as apply) [Smalltalk]', 'tool4can[SQ030]. What programming languages do you use at work? (Select as many as apply) [SQL]', 'tool4can[SQ031]. What programming languages do you use at work? (Select as many as apply) [Swift]', 'tool4can[SQ039]. What programming languages do you use at work? (Select as many as apply) [Unix Shell Scripting]', 'tool4can[SQ032]. What programming languages do you use at work? (Select as many as apply) [TypeScript]', 'tool4can[SQ033]. What programming languages do you use at work? (Select as many as apply) [VB.NET]', 'tool4can[SQ034]. What programming languages do you use at work? (Select as many as apply) [VBA]', 'tool4can[SQ035]. What programming languages do you use at work? (Select as many as apply) [Visual Basic]'], "multiple choices", "./../survey_creation/uk_17/listAnswers/programing_language.csv", "False", "./uk_2017/output/")
In [198]:
perc_to_count = get_percentage(v_to_count, filename)
In [199]:
display_side_by_side(v_to_count,perc_to_count)
Count
Python 189
Unix Shell Scripting 140
Markup languages (HTML, markdown,...) 113
C++ 94
C 92
JavaScript 85
FORTRAN 81
SQL 79
Java 65
Matlab 61
R 57
Perl 41
PHP 31
CUDA 21
C# 19
Ruby 14
TypeScript 9
Groovy 7
VBA 7
Visual Basic 7
Scala 6
Assembly 5
Go 5
Rust 3
F# 3
Elixir 2
CoffeeScript 2
Objective-C 2
Julia 2
Haskell 2
Erlang 1
Smalltalk 1
Swift 1
VB.NET 1
Clojure 1
Lua 1
Hack 0
Dart 0
Common Lisp 0
Count [PERCENTAGE]
Python 15.12
Unix Shell Scripting 11.20
Markup languages (HTML, markdown,...) 9.04
C++ 7.52
C 7.36
JavaScript 6.80
FORTRAN 6.48
SQL 6.32
Java 5.20
Matlab 4.88
R 4.56
Perl 3.28
PHP 2.48
CUDA 1.68
C# 1.52
Ruby 1.12
TypeScript 0.72
Groovy 0.56
VBA 0.56
Visual Basic 0.56
Scala 0.48
Assembly 0.40
Go 0.40
Rust 0.24
F# 0.24
Elixir 0.16
CoffeeScript 0.16
Objective-C 0.16
Julia 0.16
Haskell 0.16
Erlang 0.08
Smalltalk 0.08
Swift 0.08
VB.NET 0.08
Clojure 0.08
Lua 0.08
Hack 0.00
Dart 0.00
Common Lisp 0.00
In [200]:
_ = get_plot(perc_to_count, "multiple choices")

Group of question: ukrse

Are you a member of the UK RSE Association? (Members are people who have signed up to the UK RSE mailing list)

In [201]:
v_to_count, filename  = get_count(df, ['ukrse1. Are you a member of the UK RSE Association? (Members are people who have signed up to the UK RSE mailing list at www.rse.ac.uk)'], "y/n/na", "./../survey_creation/uk_17/listAnswers/.csv", "False", "./uk_2017/output/")
In [202]:
perc_to_count = get_percentage(v_to_count, filename)
In [203]:
display_side_by_side(v_to_count,perc_to_count)
Yes No nan
Are you a member of the UK RSE Association? (Members are people who have signed up to the UK RSE mailing list at www.rse.ac.uk) 156 62 35
Yes No
Are you a member of the UK RSE Association? (Members are people who have signed up to the UK RSE mailing list at www.rse.ac.uk) [PERCENTAGE] 71.56 28.44
In [204]:
_ = get_plot(perc_to_count, "y/n/na")

How do you meet other RSEs?

In [205]:
v_to_count, filename  = get_count(df, ['ukrse2[SQ001]. How do you meet other RSEs? [UK RSE Association]', 'ukrse2[SQ002]. How do you meet other RSEs? [Local RSE group/network]', 'ukrse2[SQ003]. How do you meet other RSEs? [N/A]'], "multiple choices", "./../survey_creation/uk_17/listAnswers/ukrse.csv", "False", "./uk_2017/output/")
In [206]:
perc_to_count = get_percentage(v_to_count, filename)
In [207]:
display_side_by_side(v_to_count,perc_to_count)
Count
N/A 101
UK RSE Association 72
Local RSE group/network 66
Count [PERCENTAGE]
N/A 42.26
UK RSE Association 30.13
Local RSE group/network 27.62
In [208]:
_ = get_plot(perc_to_count, "multiple choices")

How did you learn the skills you need to become an RSE?

In [209]:
 wc = wordcloud(df, ['ukrse3. How did you learn the skills you need to become an RSE?'])
<wordcloud.wordcloud.WordCloud object at 0x7fdb8c611048>
In [210]:
 plt.imshow(wc, interpolation='bilinear')
 plt.axis("off")
Out[210]:
(-0.5, 1499.5, 799.5, -0.5)

Group of question: skill

What three skills would you like to acquire or improve to help your work as a Research Software Engineer? The skills can be technical and non-technical.

In [211]:
 wc = wordcloud(df, ['skill2[SQ001]. What skills would you like to acquire or improve to help your work as a Research Software Engineer? The skills can be technical and non-technical. [Skill 1]', 'skill2[SQ2]. What skills would you like to acquire or improve to help your work as a Research Software Engineer? The skills can be technical and non-technical. [Skill 2]', 'skill2[SQ3]. What skills would you like to acquire or improve to help your work as a Research Software Engineer? The skills can be technical and non-technical. [Skill 3]'])
<wordcloud.wordcloud.WordCloud object at 0x7fdb8c622748>
In [212]:
 plt.imshow(wc, interpolation='bilinear')
 plt.axis("off")
Out[212]:
(-0.5, 1499.5, 799.5, -0.5)